TY - BOOK ID - 14220806 TI - XML and Web Technologies for Data Sciences with R AU - Nolan, Deborah. AU - Temple Lang, Duncan. PY - 2014 SN - 1461478995 1461479002 PB - New York, NY : Springer New York : Imprint: Springer, DB - UniCat KW - XML (Document markup language) KW - Computer science. KW - Informatics KW - Extendible Markup Language (Document markup language) KW - eXtensible Markup Language (Document markup language) KW - Mathematical statistics. KW - Statistics. KW - Statistics and Computing/Statistics Programs. KW - Programming Languages, Compilers, Interpreters. KW - Statistics, general. KW - Science KW - Statistical analysis KW - Statistical data KW - Statistical methods KW - Statistical science KW - Mathematics KW - Econometrics KW - Statistical inference KW - Statistics, Mathematical KW - Statistics KW - Probabilities KW - Sampling (Statistics) KW - Document markup languages KW - Statistics . KW - Programming languages (Electronic computers). KW - Computer languages KW - Computer program languages KW - Computer programming languages KW - Machine language KW - Electronic data processing KW - Languages, Artificial KW - R (Computer program language). KW - GNU-S (Computer program language) KW - Domain-specific programming languages KW - Programming languages (Electronic computers) UR - https://www.unicat.be/uniCat?func=search&query=sysid:14220806 AB - Web technologies are increasingly relevant to scientists working with data, for both accessing data and creating rich dynamic and interactive displays. The XML and JSON data formats are widely used in Web services, regular Web pages and JavaScript code, and visualization formats such as SVG and KML for Google Earth and Google Maps. In addition, scientists use HTTP and other network protocols to scrape data from Web pages, access REST and SOAP Web Services, and interact with NoSQL databases and text search applications. This book provides a practical hands-on introduction to these technologies, including high-level functions the authors have developed for data scientists. It describes strategies and approaches for extracting data from HTML, XML, and JSON formats and how to programmatically access data from the Web. Along with these general skills, the authors illustrate several applications that are relevant to data scientists, such as reading and writing spreadsheet documents both locally and via GoogleDocs, creating interactive and dynamic visualizations, displaying spatial-temporal displays with Google Earth, and generating code from descriptions of data structures to read and write data. These topics demonstrate the rich possibilities and opportunities to do new things with these modern technologies. The book contains many examples and case-studies that readers can use directly and adapt to their own work. The authors have focused on the integration of these technologies with the R statistical computing environment. However, the ideas and skills presented here are more general, and statisticians who use other computing environments will also find them relevant to their work. Deborah Nolan is Professor of Statistics at University of California, Berkeley. Duncan Temple Lang is Associate Professor of Statistics at University of California, Davis and has been a member of both the S and R development teams. ER -