Language(s): Arabic
DCMI Type(s): Sound
Application(s): speech recognition
No. | LDC Catalog No. | Item Name | Author(s) | Release Date | Member Year(s) | DCMI Type(s) | Sample Type | Sample Rate | Data Source(s) | Application(s) | Language(s) |
---|---|---|---|---|---|---|---|---|---|---|---|
1 | LDC2002S02 | West Point Arabic Speech | Stephen LaRocca, Rajaa Chouairi | August 20, 2002 | 2002 | Sound | 1-channel pcm | 22050 | microphone speech | speech recognition | Arabic |
2 | LDC2012S01 | 2006 NIST Speaker Recognition Evaluation Test Set Part 2 | NIST Multimodal Information Group | January 19, 2012 | 2012 | Sound | ulaw | 8000 | telephone speech, microphone speech | speech recognition | Yue Chinese, Urdu, Thai, Spanish, Russian, Korean, Hindi, Persian, English, Mandarin Chinese, Bengali, Standard Arabic, Dari, Iranian Persian, Chinese, Arabic |
3 | LDC2011S05 | 2008 NIST Speaker Recognition Evaluation Training Set Part 1 | NIST Multimodal Information Group | August 15, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech, microphone speech | speech recognition | Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Tigrinya, Thai, Tagalog, Spanish, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Central Khmer, Georgian, Japanese, Italian, Hindi, Persian, English, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Northern Khmer, Dari, Iranian Persian, Chinese, Arabic |
4 | LDC2011S01 | 2005 NIST Speaker Recognition Evaluation Training Data | NIST Multimodal Information Group | May 24, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech | speech recognition | Spanish, Russian, English, Mandarin Chinese, Arabic |
5 | LDC2011S04 | 2005 NIST Speaker Recognition Evaluation Test Data | NIST Multimodal Information Group | July 15, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech | speech recognition | Spanish, Russian, English, Mandarin Chinese, Arabic |
6 | LDC2011S07 | 2008 NIST Speaker Recognition Evaluation Training Set Part 2 | NIST Multimodal Information Group | September 15, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech, microphone speech | speech recognition | Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Tigrinya, Thai, Tagalog, Spanish, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Central Khmer, Georgian, Japanese, Italian, Hindi, Persian, English, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Northern Khmer, Dari, Iranian Persian, Chinese, Arabic |
7 | LDC2011S08 | 2008 NIST Speaker Recognition Evaluation Test Set | NIST Multimodal Information Group | October 21, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech, microphone speech | speech recognition | Yue Chinese, Wu Chinese, Vietnamese, Uzbek, Urdu, Thai, Tagalog, Tamil, Russian, Panjabi, Min Nan Chinese, Lao, Korean, Japanese, Italian, Hindi, Persian, Mandarin Chinese, Bengali, Egyptian Arabic, Moroccan Arabic, Dari, Iranian Persian, English, Chinese, Arabic |
8 | LDC2011S09 | 2006 NIST Speaker Recognition Evaluation Training Set | NIST Multimodal Information Group | November 16, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech | speech recognition | Yue Chinese, Urdu, Thai, Russian, Korean, Hindi, English, Mandarin Chinese, Bengali, Standard Arabic, Chinese, Arabic |
9 | LDC2011S10 | 2006 NIST Speaker Recognition Evaluation Test Set Part 1 | NIST Multimodal Information Group | December 15, 2011 | 2011 | Sound | ulaw | 8000 | telephone speech, microphone speech | speech recognition | Yue Chinese, Urdu, Thai, Spanish, Russian, Korean, Hindi, Persian, English, Mandarin Chinese, Bengali, Standard Arabic, Dari, Iranian Persian, Chinese, Arabic |
10 | LDC2014S02 | King Saud University Arabic Speech Database | Mansour Alsulaiman, Ghulam Muhammad, Bencherif Abdelkader, Awais Mahmood, Zulfiqar Ali | February 17, 2014 | 2014 | Sound | pcm | 48000 | microphone speech | speech recognition, speaker identification | Arabic |