This is the first repository that we have created by using VoX 0.0.1 with some modifications. The modifications include removing square roots in clustering, and using floats throughout instead of doubles. We are still using long integers in the codefile representation. Moving to short integers will reduce the bandwidth requirements by a factor of 2.
Training File Parameters
Training file size | Sampling rate | Sample size | Number of Channels | Compression type used |
~ 15 Minutes | 8000 Hz | 16 bits | 1 | PCM |
Repository Parameters
Repository Number | Number of Clusters | Frame length | MFCC features used | Number of Iterations | Size of repository obtained | Time required to generate repository |
1 | 10000 | 20 milliseconds | 0,1,2,3,4,5,6,7,8,9,10,11 | 6 | ~ 14 MB | ~ 486 minutes |
2 | 13000 | 20 milliseconds | 0,1,2,3,4,5,6,7,8,9,10,11 | 6 | ~ 14 MB | ~ 636 minutes |
Message Parameters
Using Repository | Where is the message from | Length of message file | Length of coded file | PESQ |
1 | out repository | ~ 1.9 MB | ? | 0.331 |
1 | in repository | ~ 250 KB | ? | 0.887 |
2 | in repository | ~ 250 KB | ? | 0.636 |