Narrow your search

Library

UGent (5)

KU Leuven (4)

LUCA School of Arts (4)

Odisee (4)

Thomas More Kempen (4)

Thomas More Mechelen (4)

UCLL (4)

VIVES (4)

VUB (4)

ULiège (1)


Resource type

book (5)


Language

English (5)


Year
From To Submit

2015 (5)

Listing 1 - 5 of 5
Sort by

Book
Cluster analysis for corpus linguistics
Author:
ISBN: 3110363828 3110350254 9783110363821 9783110350258 9783110363814 311036381X 9783110393170 3110393174 Year: 2015 Publisher: Berlin Boston

Loading...
Export citation

Choose an application

Bookmark

Abstract

The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

Keywords

Cluster analysis -- Data processing. --- Corpora (Linguistics) -- Data processing. --- Natural language processing (Computer science). --- Quantitative linguistics. --- Corpora (Linguistics) --- Cluster analysis --- Natural language processing (Computer science) --- Quantitative linguistics --- Computational linguistics --- Languages & Literatures --- Philology & Linguistics --- Data processing --- Cluster-Analyse. --- Korpus (Linguistik) --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft --- Korpus (Linguistik). --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation. --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft. --- Computational linguistics. --- Data processing. --- Automatic language processing --- Language and languages --- Language data processing --- Linguistics --- Natural language processing (Linguistics) --- Applied linguistics --- Cross-language information retrieval --- Mathematical linguistics --- Multilingual computing --- NLP (Computer science) --- Artificial intelligence --- Electronic data processing --- Human-computer interaction --- Semantic computing --- Corpus-based analysis (Linguistics) --- Corpus linguistics --- Linguistic analysis (Linguistics) --- Corpus linguistics. --- cluster analysis. --- hypothesis generation. --- quantitative linguistics.


Book
Recent Contributions to Quantitative Linguistics
Authors: --- --- --- --- --- et al.
ISBN: 3110420295 311042035X 9783110419870 3110419874 9783110420296 9783110420357 Year: 2015 Publisher: Berlin Boston

Loading...
Export citation

Choose an application

Bookmark

Abstract

Quantitative Linguistics is a rapidly developing discipline covering more and more areas of linguistic and textological research. The book represents an overview of the state of the art in Quantitative Linguistics, its scope and reach. Some of the topics: linguistic laws, frequency analyses, synergetic models of language, networks, part-of-speech systems, authorship attribution, polyfunctionality and polysemy, and opinion target identification.


Book
Quantitative Analysis of Poetic Texts
Authors: --- --- ---
ISBN: 3110394790 3110363798 3110336057 Year: 2015 Publisher: De Gruyter

Loading...
Export citation

Choose an application

Bookmark

Abstract

The book presents methods for the objective analysis of poetic language. Common objects of literary studies such as rhythm, semantic explications, interpretation and personal impressions are avoided. Only those properties of poetic texts are taken into account that could be quantified. The major chapters contain the analysis of phonic phenomena (frequency, euphony, assonance, alliteration, aggregation, rhyme), word properties (aspects of frequency, length, richness, word classes, sequences of word properties, characterisations). The synergetic control cycle is the result of the study of mutual links between properties. For all methods both statistical tests (evaluation, comparison), theoretical derivations (models), and examples are presented. The book is dedicated to the work of the famous Romanian poet Mihai Eminescu whose complete work was analysed, which made detailed illustrations of the method possible. The methods can be used mutatis mutandis for any language and text. It is the first comprehensive quantitative analysis of a poetic work.


Book
Sequences in language and text
Authors: ---
ISBN: 3110394774 3110362872 3110362732 9783110362879 9783110394771 9783110362732 Year: 2015 Publisher: Berlin Boston

Loading...
Export citation

Choose an application

Bookmark

Abstract

The edited volume Sequences in Language and Text is the first collection of original research in the area of the quantitative analysis of sequentially organized linguistic data. Linguistic sequences are extremely useful textual structures in almost all areas of Language Technology. Character and word n-grams are by far the most successful features in text classification tasks such as authorship identification, text categorization, genre classification, sentiment analysis etc. Furthermore character linguistic sequences are the basis for linguistic modeling and subsequent applications such as speech recognition, language identification etc. In addition to the above language technology oriented research, the present volume aims to give insight to the theoretical value of linguistic sequences. Sequences in texts can be produced by a number of different factors, either external to the linguistic system or by its own grammatical structure. This volume hosts contributions which will analyze linguistic sequences using quantitative methods under the synergetic theoretical framework that can explain their role in the linguistic system.


Book
Forms and degrees of repetition in texts : detection and analysis
Authors: ---
ISBN: 3110412020 9783110411959 3110411954 9783110411942 3110411946 9783110411942 9783110412024 9783110411799 3110411792 Year: 2015 Publisher: Berlin: De Gruyter Mouton,

Loading...
Export citation

Choose an application

Bookmark

Abstract

The present volume presents objective methods to detect and analyse various forms of repetitions. Repetition of textual elements is more than a superficial phenomenon. It may even be considered as constitutive for units and relations in a text: on a primary level when no other way exists to establish a unit – as in a musical composition (a motif can be recognised as such only after at least one repetition) – and on a secondary, artistic level, where repetition is a consequence of the transfer of the equivalence principle from the paradigmatic axis to the syntagmatic one as showed by R. Jakobson. The analysis of repetitive elements and structures in texts with objective mathematical means can serve several practical and theoretical purposes, among them: Characterisation of texts by means of parameters (measures, indicators) as taken from established mathematical statistics or specifically constructed ones in individual cases. Comparison of texts on the basis of their quantitative characteristics and classification of the texts by the results. Research for the laws of text, which control the mechanisms connected to text creation. As a remote aim, the construction of a theory of text consisting of a system of text laws. The final attempt of every possible quantitative text analysis is the construction of a text theory. The book illustrates this on examples of such laws and corresponding empirical tests.

Listing 1 - 5 of 5
Sort by