Listing 1 - 5 of 5 |
Sort by
|
Choose an application
The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.
Cluster analysis -- Data processing. --- Corpora (Linguistics) -- Data processing. --- Natural language processing (Computer science). --- Quantitative linguistics. --- Corpora (Linguistics) --- Cluster analysis --- Natural language processing (Computer science) --- Quantitative linguistics --- Computational linguistics --- Languages & Literatures --- Philology & Linguistics --- Data processing --- Cluster-Analyse. --- Korpus (Linguistik) --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft --- Korpus (Linguistik). --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation. --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft. --- Computational linguistics. --- Data processing. --- Automatic language processing --- Language and languages --- Language data processing --- Linguistics --- Natural language processing (Linguistics) --- Applied linguistics --- Cross-language information retrieval --- Mathematical linguistics --- Multilingual computing --- NLP (Computer science) --- Artificial intelligence --- Electronic data processing --- Human-computer interaction --- Semantic computing --- Corpus-based analysis (Linguistics) --- Corpus linguistics --- Linguistic analysis (Linguistics) --- Corpus linguistics. --- cluster analysis. --- hypothesis generation. --- quantitative linguistics.
Choose an application
Quantitative Linguistics is a rapidly developing discipline covering more and more areas of linguistic and textological research. The book represents an overview of the state of the art in Quantitative Linguistics, its scope and reach. Some of the topics: linguistic laws, frequency analyses, synergetic models of language, networks, part-of-speech systems, authorship attribution, polyfunctionality and polysemy, and opinion target identification.
Linguistics --- Mathematical linguistics. --- Statistical methods. --- Algebraic linguistics --- Language and languages --- Linguistics, Mathematical --- Linguistics, Statistical --- Statistical linguistics --- Statistical methods --- Mathematical models --- Applied linguistics --- Information theory --- Computational linguistics --- Mathematical linguistics --- quantitative linguistics, corpus studies, computational methods, 2014 conference, Qualico, Olomouc.
Choose an application
The book presents methods for the objective analysis of poetic language. Common objects of literary studies such as rhythm, semantic explications, interpretation and personal impressions are avoided. Only those properties of poetic texts are taken into account that could be quantified. The major chapters contain the analysis of phonic phenomena (frequency, euphony, assonance, alliteration, aggregation, rhyme), word properties (aspects of frequency, length, richness, word classes, sequences of word properties, characterisations). The synergetic control cycle is the result of the study of mutual links between properties. For all methods both statistical tests (evaluation, comparison), theoretical derivations (models), and examples are presented. The book is dedicated to the work of the famous Romanian poet Mihai Eminescu whose complete work was analysed, which made detailed illustrations of the method possible. The methods can be used mutatis mutandis for any language and text. It is the first comprehensive quantitative analysis of a poetic work.
Romanian literature --- Balkan literature --- Statistical methods. --- Data processing. --- Eminescu, Mihai, --- Eminescu, Mihai --- Eminescu, Mihail --- Eminesco, Michel --- Eminovici, Mihail --- Eminovicz, Michael --- Technique. --- quantitative linguistics. --- quantitative text analysis, poetics, statistical methodology.
Choose an application
The edited volume Sequences in Language and Text is the first collection of original research in the area of the quantitative analysis of sequentially organized linguistic data. Linguistic sequences are extremely useful textual structures in almost all areas of Language Technology. Character and word n-grams are by far the most successful features in text classification tasks such as authorship identification, text categorization, genre classification, sentiment analysis etc. Furthermore character linguistic sequences are the basis for linguistic modeling and subsequent applications such as speech recognition, language identification etc. In addition to the above language technology oriented research, the present volume aims to give insight to the theoretical value of linguistic sequences. Sequences in texts can be produced by a number of different factors, either external to the linguistic system or by its own grammatical structure. This volume hosts contributions which will analyze linguistic sequences using quantitative methods under the synergetic theoretical framework that can explain their role in the linguistic system.
Computational linguistics. --- Computational linguistics--Research. --- Computational linguistics --- Languages & Literatures --- Philology & Linguistics --- Research --- Research. --- Automatic language processing --- Language and languages --- Language data processing --- Linguistics --- Natural language processing (Linguistics) --- Data processing --- Applied linguistics --- Cross-language information retrieval --- Mathematical linguistics --- Multilingual computing --- Quantitative Linguistics, Sequence Analysis, Mathematical Linguistics.
Choose an application
The present volume presents objective methods to detect and analyse various forms of repetitions. Repetition of textual elements is more than a superficial phenomenon. It may even be considered as constitutive for units and relations in a text: on a primary level when no other way exists to establish a unit – as in a musical composition (a motif can be recognised as such only after at least one repetition) – and on a secondary, artistic level, where repetition is a consequence of the transfer of the equivalence principle from the paradigmatic axis to the syntagmatic one as showed by R. Jakobson. The analysis of repetitive elements and structures in texts with objective mathematical means can serve several practical and theoretical purposes, among them: Characterisation of texts by means of parameters (measures, indicators) as taken from established mathematical statistics or specifically constructed ones in individual cases. Comparison of texts on the basis of their quantitative characteristics and classification of the texts by the results. Research for the laws of text, which control the mechanisms connected to text creation. As a remote aim, the construction of a theory of text consisting of a system of text laws. The final attempt of every possible quantitative text analysis is the construction of a text theory. The book illustrates this on examples of such laws and corresponding empirical tests.
Linguistics -- Methodology. --- Linguistics -- Statistical methods. --- Repetition (Rhetoric). --- Writing -- Mathematical models. --- Writing --- Repetition (Rhetoric) --- Linguistics --- Languages & Literatures --- Philology & Linguistics --- Linguistic science --- Science of language --- Language and languages --- Rhetoric --- Literary style --- Chirography --- Handwriting --- Ciphers --- Penmanship --- Mathematical models --- Statistical methods --- Methodology --- Mathematical models. --- Statistical methods. --- Methodology. --- Linguistics, Statistical --- Statistical linguistics --- Mathematical linguistics --- Statistique linguistique. --- Linguistique --- Méthodologie. --- quantitative linguistics, repetition analysis. --- quantitative text analysis, statistical methodology.
Listing 1 - 5 of 5 |
Sort by
|