VoxForge Dataset

Dataset Description

VoxForge is a collection of voice recordings from various languages. The set that we use in bob.bio.spear is a part of the English VoxForge corpus. It contains:

Identities

Sample count

train

10

3148

dev

references

10

1304

probes

300

eval

references

10

1509

probes

300

GMM

To run the baseline, use the following command:

bob bio pipeline simple -d voxforge -p gmm-default -g dev -g eval -l sge -o results/gmm_voxforge

Then, to generate the scores, use:

bob bio metrics -e ./results/gmm_voxforge/scores-{dev,eval}.csv
Table 1 [Min. criterion: EER] Threshold on Development set: 2.128360e+00

Development

Evaluation

Failure to Acquire

0.0%

0.0%

False Match Rate

2.0% (54/2700)

1.8% (48/2700)

False Non Match Rate

2.0% (6/300)

1.3% (4/300)

False Accept Rate

2.0%

1.8%

False Reject Rate

2.0%

1.3%

Half Total Error Rate

2.0%

1.6%

On 128[1] CPU nodes on the SGE Grid: Ran in 13 minutes (5 minutes of training).

ISV

To run the baseline, use the following command:

bob bio pipeline simple -d voxforge -p isv-default -g dev -g eval -l sge -o results/isv_voxforge

Then, to generate the scores, use:

bob bio metrics -e ./results/isv_voxforge/scores-{dev,eval}.csv
Table 2 [Min. criterion: EER] Threshold on Development: 1.680925e+00

Development

Evaluation

Failure to Acquire

0.0%

0.0%

False Match Rate

1.3% (36/2700)

0.7% (20/2700)

False Non Match Rate

1.3% (4/300)

2.7% (8/300)

False Accept Rate

1.3%

0.7%

False Reject Rate

1.3%

2.7%

Half Total Error Rate

1.3%

1.7%

On 128[1] CPU nodes on the SGE Grid: Ran in 13 minutes (7 minutes of training).

I-Vector

To run the baseline, use the following command:

bob bio pipeline simple -d voxforge -p ivector-default -g dev -g eval -l sge -o results/ivector_voxforge

Then, to generate the scores, use:

bob bio metrics -e ./results/ivector_voxforge/scores-{dev,eval}.csv
Table 3 [Min. criterion: EER ] Threshold on Development set: -7.924394e-01

Development

Evaluation

Failure to Acquire

0.0%

0.0%

False Match Rate

4.3% (116/2700)

6.9% (186/2700)

False Non Match Rate

4.3% (13/300)

4.3% (13/300)

False Accept Rate

4.3%

6.9%

False Reject Rate

4.3%

4.3%

Half Total Error Rate

4.3%

5.6%

I-Vector PLDA

To run the baseline, use the following command:

bob bio pipeline simple -d voxforge -p ivector-plda -g dev -g eval -l sge -o results/ivector_plda_voxforge

Then, to generate the scores, use:

bob bio metrics -e ./results/ivector_plda_voxforge/scores-{dev,eval}.csv

Speechbrain ECAPA-TDNN

To run the baseline, use the following command:

bob bio pipeline simple -d voxforge -p speechbrain-ecapa-voxceleb -g dev -g eval -l sge -o results/speechbrain_voxforge

Then, to generate the scores, use:

bob bio metrics -e ./results/speechbrain_voxforge/scores-{dev,eval}.csv
Table 4 [Min. criterion: EER] Threshold on Development set: -6.159925e-01

Development

Evaluation

Failure to Acquire

0.0%

0.0%

False Match Rate

0.0% (0/2700)

0.8% (21/2700)

False Non Match Rate

0.0% (0/300)

0.0% (0/300)

False Accept Rate

0.0%

0.8%

False Reject Rate

0.0%

0.0%

Half Total Error Rate

0.0%

0.4%

On 128[1] CPU nodes on the SGE Grid: Ran in 9 minutes (no training).

Footnotes