COMPARISON TABLE BETWEEN DIFFERENT CORPUS TAKEN FROM LDC



Characteristics of the query in the LDC website:

Language(s): Arabic

DCMI Type(s): Sound

Application(s): speech recognition



No. LDC Catalog No. Item Name Author(s) Release Date Member Year(s) DCMI Type(s) Sample Type Sample Rate Data Source(s) Application(s) Language(s)
1 LDC2002S02 West Point Arabic Speech Stephen LaRocca, Rajaa Chouairi August 20, 2002 2002 Sound 1-channel pcm 22050 microphone speech speech recognition Arabic
2 LDC2012S01 2006 NIST Speaker Recognition Evaluation Test Set Part 2 NIST Multimodal Information Group January 19, 2012 2012 Sound ulaw 8000 telephone speech, microphone speech speech recognition Yue Chinese, Urdu, Thai, Spanish, Russian, Korean, Hindi, Persian, English, Mandarin Chinese, Bengali, Standard Arabic, Dari, Iranian Persian, Chinese, Arabic
3 LDC2011S05 2008 NIST Speaker Recognition Evaluation Training Set Part 1 NIST Multimodal Information Group August 15, 2011 2011 Sound ulaw 8000 telephone speech, microphone speech speech recognition Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Tigrinya, Thai, Tagalog, Spanish, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Central Khmer, Georgian, Japanese, Italian, Hindi, Persian, English, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Northern Khmer, Dari, Iranian Persian, Chinese, Arabic
4 LDC2011S01 2005 NIST Speaker Recognition Evaluation Training Data NIST Multimodal Information Group May 24, 2011 2011 Sound ulaw 8000 telephone speech speech recognition Spanish, Russian, English, Mandarin Chinese, Arabic
5 LDC2011S04 2005 NIST Speaker Recognition Evaluation Test Data NIST Multimodal Information Group July 15, 2011 2011 Sound ulaw 8000 telephone speech speech recognition Spanish, Russian, English, Mandarin Chinese, Arabic
6 LDC2011S07 2008 NIST Speaker Recognition Evaluation Training Set Part 2 NIST Multimodal Information Group September 15, 2011 2011 Sound ulaw 8000 telephone speech, microphone speech speech recognition Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Tigrinya, Thai, Tagalog, Spanish, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Central Khmer, Georgian, Japanese, Italian, Hindi, Persian, English, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Northern Khmer, Dari, Iranian Persian, Chinese, Arabic
7 LDC2011S08 2008 NIST Speaker Recognition Evaluation Test Set NIST Multimodal Information Group October 21, 2011 2011 Sound ulaw 8000 telephone speech, microphone speech speech recognition Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Thai, Tagalog, Tamil, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Japanese, Italian, Hindi, Persian, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Dari, Iranian Persian, English, Chinese, Arabic
8 LDC2011S09 2006 NIST Speaker Recognition Evaluation Training Set NIST Multimodal Information Group November 16, 2011 2011 Sound ulaw 8000 telephone speech speech recognition Yue Chinese, Urdu, Thai, Russian, Korean, Hindi, English, Mandarin Chinese, Bengali, Standard Arabic, Chinese, Arabic
9 LDC2011S10 2006 NIST Speaker Recognition Evaluation Test Set Part 1 NIST Multimodal Information Group December 15, 2011 2011 Sound ulaw 8000 telephone speech, microphone speech speech recognition Yue Chinese, Urdu, Thai, Spanish, Russian, Korean, Hindi, Persian, English, Mandarin Chinese, Bengali, Standard Arabic, Dari, Iranian Persian, Chinese, Arabic
10 LDC2014S02 King Saud University Arabic Speech Database Mansour Alsulaiman, Ghulam Muhammad, Bencherif Abdelkader, Awais Mahmood, Zulfiqar Ali February 17, 2014 2014 Sound pcm 48000 microphone speech speech recognition, speaker identification Arabic