Login / Signup

TuLeD (Tupían lexical database): introducing a database of a South American language family.

Fabrício Ferraz GerardiStanislav ReichertCarolina Coelho Aragon
Published in: Language resources and evaluation (2021)
The last two decades witnessed a rapid growth of publicly accessible online language resources. This has allowed for valuable data on lesser known languages to become available. Such resources provide linguists with opportunities for advancing their research. Yet despite the proliferation of lexical and morphological databases, the ca. 456 languages spoken in South America are poorly represented, particularly the Tupían family, which is the largest on the continent. This paper therefore introduces and discusses TuLeD, a lexical database exclusively devoted to a South American language family. It provides a comprehensive list of lexical items presented in a unified transcription for all languages with cognacy assignment and relevant (cultural or linguistic) notes. One of the main goals of TuLeD is to become a full-fledged database and a benchmark for linguistic studies on South American languages in general and the Tupían family in particular.
Keyphrases
  • autism spectrum disorder
  • adverse drug
  • big data
  • signaling pathway
  • emergency department
  • electronic health record
  • transcription factor
  • public health
  • machine learning
  • global health
  • deep learning