GitHub - jorredahl/LinguisticCS

Phonetics Visualizer

A minimal app for visualizing and comparing phonetic audio samples using waveforms and spectrograms. A user can compare two waveforms and examine differences in their waveform and spectrogram to analyze their pronounciation of words when learning a language.
Built using C++ and Qt.

📝 Table of Contents

🧐 About

Write about 1-2 paragraphs describing the purpose of your project.

This project was developed as a final project for Middlebury College's CS318: OOP & GUI Development. The original concept of the Phonetics Visualizer is to aid language learners' pronounciation by algorithmically identifying differences in pronounciation between an uploaded and a recorded sentence. These could be differences in length of phoneme, pitch, relative emphasis, et cetera. We believe that identifying these differences for the listener, then giving them capability to isolate, listen, and visualize these shortcomings in pronounciation would significantly enhance the language learning process.

Although over the short course of the semester we were not able to implement an algorithmic framework to recognize differences in these across recordings, our team developed visualization and playback capabilities that allow for user analysis.

🏁 Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

What things you need to install the software and how to install them.

An instance of Qt and Qt Creator. This is for the development version.
1. This requires XCode if you are on a Mac.
Right now the software is dependent on the FFTW library. We have been running it from within our Qt development instance, which is dynamically linking to the FFTW headers from our \usr\local\lib directory. Hopefully for the production version we will have this code fully linked and included in a standalone executable.

Installing

A step by step series of examples that tell you how to get a development env running.

Install XCode, Qt, Qt Creator. There are plenty of better guides to do this online than anything we could write.
Git clone the project into your desired directory.

git clone https://github.com/jorredahl/LinguisticCS.git

Install FFTW. We did this using homebrew, then to get the FFTW package to where our .pro file was looking we used

sudo cp /opt/homebrew/Cellar/fftw/3.x.x/lib/* /usr/local/lib/

🎈 Usage

Upon starting the Phonetics Visualizer, the user will be greeted with some controls and a blank waveform visualizer. Almost every control is initially disabled - you have to first upload an audio file.

Loading audio files

The first step is to load an audio file to compare.

Click the top "Upload" button to upload a source file to compare your audio against.
There will be a pop-up file explorer. Navigate in your computer directory to the desired audio file and click "Open" in the bottom right to open that.

Note: only .wav files are accepted at the moment. A 24 or 32 bitrate is encouraged for the file.

Please see the User Documentation PDF and the User Documentation PDF for additional details.

⛏️ Built Using

C++
Qt
FFTW

✍️ Authors

See also the list of contributors who participated in this project.

🎉 Acknowledgements

Thank you to Professor Swenton for the plethora of help throughout the semester!
Thank you to Professor Baird for the linguistics resources and Professor Abe for the Japenese resources!

Name		Name	Last commit message	Last commit date
Latest commit History 151 Commits
japanese-audios		japanese-audios
resources/icons		resources/icons
.gitignore		.gitignore
.gitignore.save		.gitignore.save
Developer_Documentation.pdf		Developer_Documentation.pdf
Phonetics.pro		Phonetics.pro
README.md		README.md
User_Documentation.pdf		User_Documentation.pdf
audio.cpp		audio.cpp
audio.h		audio.h
main.cpp		main.cpp
mainwindow.cpp		mainwindow.cpp
mainwindow.h		mainwindow.h
resources.qrc		resources.qrc
segmentgraph.cpp		segmentgraph.cpp
segmentgraph.h		segmentgraph.h
spectrograph.cpp		spectrograph.cpp
spectrograph.h		spectrograph.h
waveformsegments.cpp		waveformsegments.cpp
waveformsegments.h		waveformsegments.h
wavfile.cpp		wavfile.cpp
wavfile.h		wavfile.h
wavform.cpp		wavform.cpp
wavform.h		wavform.h
zoom.cpp		zoom.cpp
zoom.h		zoom.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Phonetics Visualizer

📝 Table of Contents

🧐 About

🏁 Getting Started

Prerequisites

Installing

🎈 Usage

Loading audio files

Please see the User Documentation PDF and the User Documentation PDF for additional details.

⛏️ Built Using

✍️ Authors

🎉 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Phonetics Visualizer

📝 Table of Contents

🧐 About

🏁 Getting Started

Prerequisites

Installing

🎈 Usage

Loading audio files

Please see the User Documentation PDF and the User Documentation PDF for additional details.

⛏️ Built Using

✍️ Authors

🎉 Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages