Narrow your search

Library

KU Leuven (1)

LUCA School of Arts (1)

Odisee (1)

Thomas More Kempen (1)

Thomas More Mechelen (1)

UCLL (1)

UGent (1)

VIVES (1)

VUB (1)


Resource type

book (1)


Language

English (1)


Year
From To Submit

2015 (1)

Listing 1 - 1 of 1
Sort by

Book
Cluster analysis for corpus linguistics
Author:
ISBN: 3110363828 3110350254 9783110363821 9783110350258 9783110363814 311036381X 9783110393170 3110393174 Year: 2015 Publisher: Berlin ; Boston : De Gruyter,

Loading...
Export citation

Choose an application

Bookmark

Abstract

The standard scientific methodology in linguistics is empirical testing of falsifiable hypotheses. As such the process of hypothesis generation is central, and involves formulation of a research question about a domain of interest and statement of a hypothesis relative to it. In corpus linguistics the domain is text, and generation involves abstraction of data from text, data analysis, and formulation of a hypothesis based on inference from the results. Traditionally this process has been paper-based, but the advent of electronic text has increasingly rendered it obsolete both because the size of digital corpora is now at or beyond the limit of what can efficiently be used in the traditional way, and because the complexity of data abstracted from them can be impenetrable to understanding. Linguists are increasingly turning to mathematical and statistical computational methods for help, and cluster analysis is such a method. It is used across the sciences for hypothesis generation by identification of structure in data which are too large or complex, or both, to be interpretable by direct inspection. This book aims to show how cluster analysis can be used for hypothesis generation in corpus linguistics, thereby contributing to a quantitative empirical methodology for the discipline.

Keywords

Cluster analysis -- Data processing. --- Corpora (Linguistics) -- Data processing. --- Natural language processing (Computer science). --- Quantitative linguistics. --- Corpora (Linguistics) --- Cluster analysis --- Natural language processing (Computer science) --- Quantitative linguistics --- Computational linguistics --- Languages & Literatures --- Philology & Linguistics --- Data processing --- Cluster-Analyse. --- Korpus (Linguistik) --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft --- Korpus (Linguistik). --- Corpus linguistics; cluster analysis; quantitative linguistics; hypothesis generation. --- (VLB-WN)1561: Hardcover, Softcover / Allgemeine und Vergleichende Sprachwissenschaft. --- Computational linguistics. --- Data processing. --- Automatic language processing --- Language and languages --- Language data processing --- Linguistics --- Natural language processing (Linguistics) --- Applied linguistics --- Cross-language information retrieval --- Mathematical linguistics --- Multilingual computing --- NLP (Computer science) --- Artificial intelligence --- Electronic data processing --- Human-computer interaction --- Semantic computing --- Corpus-based analysis (Linguistics) --- Corpus linguistics --- Linguistic analysis (Linguistics) --- Corpus linguistics. --- cluster analysis. --- hypothesis generation. --- quantitative linguistics.

Listing 1 - 1 of 1
Sort by