Listing 1 - 10 of 18 | << page >> |
Sort by
|
Choose an application
Choose an application
Choose an application
Choose an application
Psycholinguistics --- Mathematical linguistics --- Linguistics
Choose an application
Lexicology. Semantics --- Mathematical linguistics --- Grammar
Choose an application
Lexicology. Semantics --- Mathematical linguistics --- Theses
Choose an application
Choose an application
Computer-assisted corpus linguistics is one of the main points of convergence between linguistic and computational methods. In particular, the use of diachronic linguistic corpora provides opportunities for the quantitative analysis of phenomena concerning language change through time. This dissertation offers contributions to three of the stages of the research involving diachronic corpora: (a) corpus building and compilation; (b) designing of tools and algorithms for data exploration; and (c) data analysis for linguistic, cultural and historical research. Two resources are first presented: a Web scraper of comments from news portals; and a diachronic corpus composed of comments published in a major Brazilian news website. These resources are relevant not only for linguists, but also for professionals concerned with the public perception of news and the relationship between media and society. Then, we propose a generalizable method to assist the identification of periods of establishment and obsolescence of linguistic items in a diachronic corpus based on the frequency of these items in the corpus. This method may be employed for the analysis of any collection of linguistic items, regardless of language or historical period. Finally, we describe how diachronic corpora might be used for quantitative linguistic investigation by proposing a framework centered on the investigation of vocabulary through a diachronic approach. The applicability of this framework is demonstrated through the case analysis of the use of the term fake news in the media. With these contributions, we expect to advance research on diachronic corpus linguistics and on computational methods for linguistic analysis.
Mathematical linguistics --- Theses --- E-books
Choose an application
This thesis presents a novel approach to the processing and representation of natural language syntax and semantics, combining symbolic and neural techniques.The symbolic core is powered by a linear type system that uses modalities to capture dependency structures on top of function-argument relations, enabling a more flexible and expressive way of representing grammatical utterances.The practical applications of this approach are showcased through the computational study of Dutch, utilizing a set of tools and resources developed specifically for this purpose. These include a large proofbank, i.e., a collection of sentences associated with tectogrammatic theorems and their corresponding programs, supported by an extensive type lexicon, which provides type assignments to almost one million lexical tokens within a given linguistic context. Parsing is handled by a combination of static type-checking, a state-of-the-art supertagger based on heterogeneous graph convolutions, and a massively parallel proof search component formulated as a neural bijection learner.Overall, this thesis demonstrates the power of an integrated neurosymbolic approach to natural language processing combining the best of both worlds - the symbolic representation of meaning and the statistical power of modern neural networks.
Lexicology. Semantics --- Grammar --- Mathematical linguistics
Choose an application
Listing 1 - 10 of 18 | << page >> |
Sort by
|