Redundancy And Productivity In The Speech Technology Lexicon
- Topics:
- Efficiency
- Tags:
- Advertising & Promotion,
- Emerging Technologies,
- Lexicon,
- Marketing,
- Speech Recognition,
- University Of Edinburgh
- Source:
- University of Edinburgh
FREE Registration is required
Overview: Current lexica for speech technology typically contain much redundancy, while omitting useful information. A comparison with lexica in other media and for other purposes is instructive, as it highlights some features we may borrow for text-to-speech and speech recognition lexica. We describe some aspects of the new lexicon we are producing, Combilex, whose structure and implementation is specifically designed to reduce redundancy and improve the representation of productive elements of English. Most importantly, many English words are predictable derivations of baseforms, or compounds. Storing the lexicon as a combination of baseforms and derivational rules speeds up lexicon development, and improves coverage and maintainability.
(Is this item miscategorized? Does it need more tags? Let us know.)
Format: PDF | Size: 222KB | Date: Jun 2006 | Pages: 4



