renee_thesis

All the code for my thesis on improving speech recognition systems for children. I am currently working on using Facebook AI's wav2vec 2.0.

Steps for kaldi: Make sure kaldi is installed first before proceeding.

Download all the required datasets
In s5/run.sh modify [DATASET NAME]_ROOT to point to the main directory of the dataset
./run.sh This will build the HMM-GMM model
After completed successfully run local/nnet3/run_tdnn_delta.sh for the TDNN model THe script s5/clean.sh will remove file created from s5/run.sh so that you can train the models again.

Steps for wav2vec 2.0: [WIP] Make sure PyTorch and fairseq are installed first before proceeding.

In s5/wav2vec_projects run the various run_* scripts.

Install Kaldi: Refer to: https://kaldi-asr.org/doc/tutorial_setup.html

git clone https://github.com/kaldi-asr/kaldi.git
Look at the kaldi/INSTALL file and follow the instructions there
Download SRILM by running kaldi/tools/install_srilm.sh

Install PyTorch & fairseq PyTorch: https://pytorch.org/get-started/locally/ fairseq: https://www.folio3.ai/blog/fairseq/

Initialising katana:

To execute the steps using the supercomputer katana.

In Windows PuTTY: use host name = katana1.restech.unsw.edu.au and log in using zID and password In Linux: ssh zID@katana1.restech.unsw.edu.au in terminal. Or, use the alias katana.
Create a new screen using screen -S nameOfSession (the screen I'm using is called 'thesis')
Request an interactive GPU node using qsub -I -l select=1:ngpus=1:ncpus=8:mem=46gb,walltime=12:00:00. Once the node is ready, you are now in the node. The terminal will show (zID@kxxx), where kxxx is your node.
Now you are inside the screen, and inside the GPU node. Run whatever process you need.
To load modules and go inside the thesis virtual environment, run startup.sh

Note: If there is an error message saying Permission Denied when running a script, use chmod u+x -R /path/to/directory to change the permissions of all the files in the directory so that you have permission to execute.

To install any python packages not in katana, use a virtual environment: https://packaging.python.org/guides/installing-using-pip-and-virtual-environments/ The virtual environment I am using currently is kaldi/egs/renee_thesis/thesis_env source thesis_env/bin/activate To list all the packages in this virtual environment use pip list

Connecting to GitHub

To start a new git repository follow the instructions here: https://kbroman.org/github_tutorial/pages/init.html
When ready to push changes:

git add -A Adds all files

git commit -m "Commit message here" Commits changes with a message

git push origin master Pushes changes
If there are Permission Denied errors follow the instructions here: https://gist.github.com/adamjohnson/5682757
To check the status use git status

Leaving katana Asumming you are inside a screen, and inside a requested GPU node.

CtrlA D to detach from the screen session.
exit to logout of the katana session.

Returning to katana

ssh zID@katana1.restech.unsw.edu.au in terminal. Or, use the alias katana.
Go back to your screen screen -r nameOfSession eg. screen -r ogi

Useful katana screen things

To create a new window (tab) within a screen, use CtrlA C
To go to next and previous windows, use CtrlA N and CtrlA P respectively.
To check if you are in a screen, type the command echo $TERM
To list your screen, type command screen -list
To detach a screen remotely, find the screen name using screen -list and then screen -d [name of screen]

Bash things

Create aliases in ~./bashrc by editing ~./bashrc and then running source ~./bashrc: https://linuxize.com/post/how-to-create-bash-aliases/
Current aliases: t2renee_thesis t2chacmod
Screen tabs: https://unix.stackexchange.com/questions/26248/tabs-when-using-screen/319364

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
deepspeech		deepspeech
s5		s5
.gitignore		.gitignore
README.md		README.md
example.py		example.py
myjob_10h.pbs		myjob_10h.pbs
myjob_10min.pbs		myjob_10min.pbs
myjob_120h.pbs		myjob_120h.pbs
myjob_1h.pbs		myjob_1h.pbs
myjob_37h.pbs		myjob_37h.pbs
myjob_5h.pbs		myjob_5h.pbs
myjob_85h.pbs		myjob_85h.pbs
myjob_C_OGI-10h.pbs		myjob_C_OGI-10h.pbs
myjob_C_OGI-10min.pbs		myjob_C_OGI-10min.pbs
myjob_C_OGI-1h.pbs		myjob_C_OGI-1h.pbs
myjob_C_OGI-5h.pbs		myjob_C_OGI-5h.pbs
myjob_C_OGI.pbs		myjob_C_OGI.pbs
myjob_C_TLT-Librispeech.pbs		myjob_C_TLT-Librispeech.pbs
myjob_C_TLT.pbs		myjob_C_TLT.pbs
myjob_C_TLT17.pbs		myjob_C_TLT17.pbs
myjob_C_deep-eval-OGI.pbs		myjob_C_deep-eval-OGI.pbs
myjob_C_deep-eval-TLT17.pbs		myjob_C_deep-eval-TLT17.pbs
myjob_C_deep-eval-myST.pbs		myjob_C_deep-eval-myST.pbs
myjob_C_eval.pbs		myjob_C_eval.pbs
myjob_C_eval_age.pbs		myjob_C_eval_age.pbs
myjob_C_eval_baseline_age.pbs		myjob_C_eval_baseline_age.pbs
myjob_C_eval_uttlen.pbs		myjob_C_eval_uttlen.pbs
myjob_C_myST-10h.pbs		myjob_C_myST-10h.pbs
myjob_C_myST-10min.pbs		myjob_C_myST-10min.pbs
myjob_C_myST-1h.pbs		myjob_C_myST-1h.pbs
myjob_C_myST-5h-robust-LS.pbs		myjob_C_myST-5h-robust-LS.pbs
myjob_C_myST-5h.pbs		myjob_C_myST-5h.pbs
myjob_C_myST-OGI-10h.pbs		myjob_C_myST-OGI-10h.pbs
myjob_C_myST-OGI-10min.pbs		myjob_C_myST-OGI-10min.pbs
myjob_C_myST-OGI-1h.pbs		myjob_C_myST-OGI-1h.pbs
myjob_C_myST-OGI-5h.pbs		myjob_C_myST-OGI-5h.pbs
myjob_C_myST-OGI-TLT-Librispeech.pbs		myjob_C_myST-OGI-TLT-Librispeech.pbs
myjob_C_myST-OGI-TLT-half.pbs		myjob_C_myST-OGI-TLT-half.pbs
myjob_C_myST-OGI-TLT-noSpec.pbs		myjob_C_myST-OGI-TLT-noSpec.pbs
myjob_C_myST-OGI-TLT-robust-LS.pbs		myjob_C_myST-OGI-TLT-robust-LS.pbs
myjob_C_myST-OGI-TLT-robust.pbs		myjob_C_myST-OGI-TLT-robust.pbs
myjob_C_myST-OGI-TLT.pbs		myjob_C_myST-OGI-TLT.pbs
myjob_C_myST-OGI-TLT17.pbs		myjob_C_myST-OGI-TLT17.pbs
myjob_C_myST-OGI.pbs		myjob_C_myST-OGI.pbs
myjob_C_myST.pbs		myjob_C_myST.pbs
myjob_C_myST_unk.pbs		myjob_C_myST_unk.pbs
myjob_C_pretrain.pbs		myjob_C_pretrain.pbs
myjob_C_pretrain_draft.pbs		myjob_C_pretrain_draft.pbs
myjob_OGI.pbs		myjob_OGI.pbs
myjob_deepspeech-5h.pbs		myjob_deepspeech-5h.pbs
myjob_eval.pbs		myjob_eval.pbs
myjob_eval_85h.pbs		myjob_eval_85h.pbs
myjob_eval_OGI-test.pbs		myjob_eval_OGI-test.pbs
myjob_eval_myST-test.pbs		myjob_eval_myST-test.pbs
myjob_myST.pbs		myjob_myST.pbs
requirements.txt		requirements.txt
requirements_gadi.txt		requirements_gadi.txt
test.py		test.py
test_train.py		test_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

renee_thesis

About

Uh oh!

Releases

Packages

Languages

monomest/Thesis

Folders and files

Latest commit

History

Repository files navigation

renee_thesis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages