Programming Elastic MapReduce - Using AWS Services to Build an End-to-End Application - E-book - Multi-format

Edition en anglais

Christopher Phillips

,

Kevin Schmidt

Note moyenne 
Christopher Phillips et Kevin Schmidt - Programming Elastic MapReduce - Using AWS Services to Build an End-to-End Application.
Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started.... Lire la suite
25,99 € E-book - Multi-format
Vous pouvez lire cet ebook sur les supports de lecture suivants :
Bientôt disponible
Recevez un email dès que l'ouvrage est disponible

Résumé

Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application.
Using code samples and example configurations, you'll learn how to assemble the building blocks necessary to solve your biggest data analysis problems. - Get an overview of the AWS and Apache software tools used in large-scale data analysis - Go through the process of executing a Job Flow with a simple log analyzer - Discover useful MapReduce patterns for filtering and analyzing data sets - Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow - Learn the basics for using Amazon EMR to run machine learning algorithms - Develop a project cost model for using Amazon EMR and other AWS tools

Caractéristiques

  • Date de parution
    10/12/2013
  • Editeur
    O'Reilly Media
  • ISBN
    978-1-4493-6359-8
  • EAN
    9781449363598
  • Format
    Multi-format
  • Nb. de pages
    174 pages
  • Caractéristiques du format Multi-format
    • Pages
      174
  • Caractéristiques du format PDF
    • Protection num.
      pas de protection
  • Caractéristiques du format ePub
    • Protection num.
      pas de protection
  • Caractéristiques du format Mobipocket
    • Protection num.
      pas de protection
  • Caractéristiques du format Streaming
    • Protection num.
      pas de protection

Avis libraires et clients

Avis audio

Écoutez ce qu'en disent nos libraires !

À propos des auteurs

Kevin J. Schmidt is a senior manager at Dell SecureWorks, Inc., an industry leading MSSP, which is part of Dell. He is responsible for the design and development of a major part of the company's SIEM platform. This includes data acquisition, correlation, and analysis of log data. Prior to SecureWorks, Kevin worked for Reflex Security, where he worked on an IPS engine and anti-virus software. And prior to this, he was a lead developer and architect at GuardedNet, Inc., which built one of the industry's first SIEM platforms. He is also a commissioned officer in the United States Navy Reserve (USNR).
He has over 19 years of experience in software development and design, 11 of which have been in the network security space. He holds a Bachelor of Science in Computer Science. Kevin has spent time designing cloud services components at Dell, including virtualized components to run in Dell's own vCloud. These components are used to protect customers who use Dell's cloud infrastructure. Additionally, he has been working with Hadoop, machine learning, and other technology in the cloud. Kevin is co-author of Essential SNMP, second edition (O'Reilly and Associates, ISBN: 978-0-596-00840-6) and also Logging and Log Management: The Authoritative Guide to Understanding the Concepts Surrounding Logging and Log Management (Syngress, ISBN: 978-1-597-49635-3). Christopher Phillips is a manager and senior software developer at Dell SecureWorks, Inc, an industry leading MSSP, which is part of Dell.
He is responsible for the design and development of the company's Threat Intelligence service platform. He also has responsibility for a team involved in integrating log and event information from many third-party providers that allow customers to have all of their core security information delivered to and analyzed by the Dell SecureWorks systems and security professionals. Prior to Dell SecureWorks, Chris worked for McKesson and Allscripts, where he worked with clients on HIPAA compliance, security, and healthcare systems integration.
He has over 18 years of experience in software development and design. He holds a Bachelor of Science in Computer Science and an MBA. Chris has spent time designing and developing virtualization and cloud Infrastructure as a Service strategies at Dell to help our security services scale globally Additionally, he has been working with Hadoop, Pig scripting languages, and Amazon Elastic Map Reduce to develop strategies to gain insights and analyze Big Data issues in the cloud. Chris is co-author of Logging and Log Management: The Authoritative Guide to Understanding the Concepts Surrounding Logging and Log Management (Syngress, ISBN: 978-1-597-49635-3).

Derniers produits consultés

Programming Elastic MapReduce - Using AWS Services to Build an End-to-End Application est également présent dans les rayons