Listing 1 - 10 of 2443 | << page >> |
Sort by
|
Choose an application
Unlock the full potential of Azure Synapse Analytics with Ultimate Azure Synapse Analytics, your definitive roadmap to mastering the art of data analytics in the cloud era. From the foundational concepts to advanced techniques, each chapter offers practical insights and hands-on tutorials to streamline your data workflows and drive actionable insights. Discover how Azure Synapse Analytics revolutionizes data processing and integration, empowering you to harness the vast capabilities of the Azure ecosystem. Seamlessly transition from traditional data warehousing to cutting-edge big data analytics, leveraging serverless and dedicated resources for optimal performance. Dive deep into Synapse SQL, explore advanced data engineering with Apache Spark, and delve into machine learning and DevOps practices to stay ahead in today's data-driven landscape. Whether you're seeking to optimize performance, ensure compliance, or facilitate seamless migration, this book provides the expertise needed to excel in your role. Gain valuable insights into industry best practices, enhance your data engineering skills, and drive innovation within your organization.
Choose an application
Apache Arrow is an open source, columnar in-memory data format designed for efficient data processing and analytics. This book harnesses the author’s 15 years of experience to show you a standardized way to work with tabular data across various programming languages and environments, enabling high-performance data processing and exchange. This updated second edition gives you an overview of the Arrow format, highlighting its versatility and benefits through real-world use cases. It guides you through enhancing data science workflows, optimizing performance with Apache Parquet and Spark, and ensuring seamless data translation. You’ll explore data interchange and storage formats, and Arrow's relationships with Parquet, Protocol Buffers, FlatBuffers, JSON, and CSV. You’ll also discover Apache Arrow subprojects, including Flight, SQL, Database Connectivity, and nanoarrow. You’ll learn to streamline machine learning workflows, use Arrow Dataset APIs, and integrate with popular analytical data systems such as Snowflake, Dremio, and DuckDB. The latter chapters provide real-world examples and case studies of products powered by Apache Arrow, providing practical insights into its applications. By the end of this book, you’ll have all the building blocks to create efficient and powerful analytical services and utilities with Apache Arrow.
Choose an application
In today's data-driven world, mastering data engineering is crucial for driving innovation and delivering real business impact. Databricks is one of the most powerful platforms which unifies data, analytics and AI requirements of numerous organizations worldwide. Mastering Data Engineering and Analytics with Databricks goes beyond the basics, offering a hands-on, practical approach tailored for professionals eager to excel in the evolving landscape of data engineering and analytics. This book uniquely blends foundational knowledge with advanced applications, equipping readers with the expertise to build, optimize, and scale data pipelines that meet real-world business needs. With a focus on actionable learning, it delves into complex workflows, including real-time data processing, advanced optimization with Delta Lake, and seamless ML integration with MLflow--skills critical for today's data professionals. Drawing from real-world case studies in FMCG and CPG industries, this book not only teaches you how to implement Databricks solutions but also provides strategic insights into tackling industry-specific challenges. From setting up your environment to deploying CI/CD pipelines, you'll gain a competitive edge by mastering techniques that are directly applicable to your organization's data strategy. By the end, you'll not just understand Databricks--you'll command it, positioning yourself as a leader in the data engineering space.
Choose an application
The breadth of problems that can be solved with data science is astonishing, and this book provides the required tools and skills to a broad audience. The necessary background in computer science, mathematics, and statistics is provided in an approachable manner.
Choose an application
Choose an application
Choose an application
Choose an application
Data warehouse and data mining are essential technologies in the field of data analysis and business intelligence. Data warehouse provides a centralized repository of structured data and facilitates data storage and retrieval. Data mining, on the other hand, utilizes various algorithms and techniques to extract valuable patterns, trends, and insights from large datasets.
Data warehousing. --- Data mining. --- Data warehousing --- Data mining
Choose an application
Choose an application
Listing 1 - 10 of 2443 | << page >> |
Sort by
|