NotesFAQContact Us
Collection
Advanced
Search Tips
Showing all 3 results Save | Export
Peer reviewed Peer reviewed
Ide, Nancy – Computers and the Humanities, 1995
Describes problems in devising a Text Encoding Initiative (TEI) encoding format for dictionaries. Asserts that the high degree of structuring and compression of information are among the most complex text types treated in the TEI. Concludes that the source of some TEI problems lies in the design of Standard Generalized Markup Language (SGML). (CFR)
Descriptors: Databases, Dictionaries, Higher Education, Lexicography
Ide, Nancy – 1995
The demand for extensive reusability of large language text collections for natural languages processing research requires development of standardized encoding formats. Such formats must be capable of representing different kinds of information across the spectrum of text types and languages, capable of representing different levels of…
Descriptors: Coding, Computational Linguistics, Computer Software, Descriptive Linguistics
Erjavec, Tomaz; Ide, Nancy; Petkevic, Vladimir; Veronis, Jean – 1995
MULTEXT is a European Union project to identify and develop language resources, language-related software, and standards to make the resources maximally usable. MULTEXT-EAST is a spinoff project to develop significant resources for six Central and Eastern European (CEE) languages (Bulgarian, Czech, Estonian, Hungarian, Romanian, Slovenian) and…
Descriptors: Bulgarian, Computational Linguistics, Computer Software, Czech