UniCat-Search

Union Catalogue of Belgian Libraries

English | Nederlands | Français

Feedback

About UniCat

Help

News

Narrow your search

Library

KBC (2)

Resource type

book (2)

Language

English (2)

Year

From To

2018 (1)

2017 (1)

Listing 1 - 2 of 2
Sort by

Book

Query processing on probabilistic data : a survey
Authors: Van den Broeck, Guy --- Suciu, Dan
ISBN: 1680833154 1680833146 Year: 2017 Publisher: [Hanover, Massachusetts] : [Piscataqay, New Jersey] : Now Publishers, IEEE Xplore,

Abstract | Keywords | Export | Availability | Bookmark

Loading...

Export citation
Choose an application

Reference Manager

EndNote

RefWorks (Direct export to RefWorks)

Bookmark

Abstract
Probabilistic data is motivated by the need to model uncertainty in large databases. Over the last twenty years or so, both the Database community and the AI community have studied various aspects of probabilistic relational data. This survey presents the main approaches developed in the literature, reconciling concepts developed in parallel by the two research communities. The survey starts with an extensive discussion of the main probabilistic data models and their relationships, followed by a brief overview of model counting and its relationship to probabilistic data. After that, the survey discusses lifted probabilistic inference, which are a suite of techniques developed in parallel by the Database and AI communities for probabilistic query evaluation. Then, it gives a short summary of query compilation, presenting some theoretical results highlighting limitations of various query evaluation techniques on probabilistic data. The survey ends with a very brief discussion of some popular probabilistic data sets, systems, and applications that build on this technology.

Keywords
Querying (Computer science)

Book

Algorithmic aspects of parallel data processing
Authors: Koutris, Paraschos --- Salihoglu, Semih --- Suciu, Dan
ISBN: 168083407X 1680834061 Year: 2018 Publisher: [Boston, Massachusetts] : [Piscataqay, New Jersey] : Now Publishers, IEEE Xplore,

Abstract | Keywords | Export | Availability | Bookmark

Loading...

Export citation
Choose an application

Reference Manager

EndNote

RefWorks (Direct export to RefWorks)

Bookmark

Abstract
The last decade has seen a huge and growing interest in processing large data sets on large distributed clusters. This trend began with the MapReduce framework, and has been widely adopted by several other systems, including PigLatin, Hive, Scope, Dremmel, Spark and Myria to name a few. While the applications of such systems are diverse (for example, machine learning, data analytics), most involve relatively standard data processing tasks like identifying relevant data, cleaning, filtering, joining, grouping, transforming, extracting features, and evaluating results. This has generated great interest in the study of algorithms for data processing on large distributed clusters. Algorithmic Aspects of Parallel Data Processing discusses recent algorithmic developments for distributed data processing. It uses a theoretical model of parallel processing called the Massively Parallel Computation (MPC) model, which is a simplification of the BSP model where the only cost is given by the amount of communication and the number of communication rounds. The survey studies several algorithms for multi-join queries, sorting, and matrix multiplication. It discusses their relationships and common techniques applied across the different data processing tasks.

Keywords
Parallel processing (Electronic computers)

Listing 1 - 2 of 2
Sort by