Learning Hadoop 2: design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2
(eBook)

Book Cover
Contributors:
Published:
Birmingham, UK : Packt Publishing, 2015.
Format:
eBook
ISBN:
9781783285525, 1783285524
Physical Desc:
1 online resource (1 volume) : illustrations
Status:
Ebsco (CCU)
Description

If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus.

Copies
Ebsco (CCU)
More Like This
Citations
APA Citation (style guide)

Turkington, G., & Modena, G. (2015). Learning Hadoop 2: design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2. Birmingham, UK, Packt Publishing.

Chicago / Turabian - Author Date Citation (style guide)

Turkington, Garry and Gabriele, Modena. 2015. Learning Hadoop 2: Design and Implement Data Processing, Lifecycle Management, and Analytic Workflows With the Cutting-edge Toolbox of Hadoop 2. Birmingham, UK, Packt Publishing.

Chicago / Turabian - Humanities Citation (style guide)

Turkington, Garry and Gabriele, Modena, Learning Hadoop 2: Design and Implement Data Processing, Lifecycle Management, and Analytic Workflows With the Cutting-edge Toolbox of Hadoop 2. Birmingham, UK, Packt Publishing, 2015.

MLA Citation (style guide)

Turkington, Garry, and Gabriele Modena. Learning Hadoop 2: Design and Implement Data Processing, Lifecycle Management, and Analytic Workflows With the Cutting-edge Toolbox of Hadoop 2. Birmingham, UK, Packt Publishing, 2015.

Note! Citation formats are based on standards as of July 2022. Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy.
More Copies In Prospector
Loading Prospector Copies...
More Details
Language:
English

Notes

General Note
Includes index.
Description
If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus.
Language
English.
Staff View
Grouped Work ID:
d30ad3bd-031b-95cd-7868-0821858159eb
Go To GroupedWork

Record Information

Last File Modification TimeApr 05, 2024 09:38:34 PM
Last Grouped Work Modification TimeApr 05, 2024 09:12:39 PM

MARC Record

LEADER07116cam a2200937 i 4500
001ocn905091883
003OCoLC
00520240329122006.0
006m     o  d        
007cr unu||||||||
008150318s2015    enka    o     001 0 eng d
040 |a UMI|b eng|e rda|e pn|c UMI|d COO|d N$T|d DEBBG|d YDXCP|d OCLCF|d CEF|d UKMGB|d UAB|d AU@|d UKAHL|d SFB|d OCLCO|d OCLCQ|d QGK|d OCLCO|d OCLCL
015 |a GBB6G3008|2 bnb
0167 |a 018006559|2 Uk
019 |a 1259248353
020 |a 9781783285525|q (electronic bk.)
020 |a 1783285524|q (electronic bk.)
020 |z 1783285524
020 |z 1783285516
020 |z 9781783285518
0291 |a DEBBG|b BV042682824
0291 |a DEBSZ|b 446580252
0291 |a GBVCP|b 829881018
0291 |a UKMGB|b 018006559
035 |a (OCoLC)905091883|z (OCoLC)1259248353
037 |a CL0500000568|b Safari Books Online
050 4|a QA76.9.D5
072 7|a COM|x 013000|2 bisacsh
072 7|a COM|x 014000|2 bisacsh
072 7|a COM|x 018000|2 bisacsh
072 7|a COM|x 067000|2 bisacsh
072 7|a COM|x 032000|2 bisacsh
072 7|a COM|x 037000|2 bisacsh
072 7|a COM|x 052000|2 bisacsh
08204|a 004.36|2 23
049 |a MAIN
1001 |a Turkington, Garry,|e author.
24510|a Learning Hadoop 2 :|b design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 /|c Garry Turkington, Gabriele Modena.
2463 |a Learning Hadoop two
24630|a Design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2
264 1|a Birmingham, UK :|b Packt Publishing,|c 2015.
300 |a 1 online resource (1 volume) :|b illustrations
336 |a text|b txt|2 rdacontent
337 |a computer|b c|2 rdamedia
338 |a online resource|b cr|2 rdacarrier
347 |a text file
4901 |a Community experience distilled
5880 |a Online resource; title from cover (Safari, viewed March 10, 2015).
500 |a Includes index.
5050 |a Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Introduction; A note on versioning; The background of Hadoop; Components of Hadoop; Common building blocks; Storage; Computation; Better together; Hadoop 2 -- what's the big deal?; Storage in Hadoop 2; Computation in Hadoop 2; Distributions of Apache Hadoop; A dual approach; AWS -- infrastructure on demand from Amazon; Simple Storage Service (S3); Elastic MapReduce (EMR); Getting started; Cloudera QuickStart VM; Amazon EMR; Creating an AWS account
5058 |a Signing up for the necessary servicesUsing Elastic MapReduce; Getting Hadoop up and running; How to use EMR; AWS credentials; The AWS command-line interface; Running the examples; Data processing with Hadoop; Why Twitter?; Building our first dataset; One service, multiple APIs; Anatomy of a Tweet; Twitter credentials; Programmatic access with Python; Summary; Chapter 2: Storage; The inner workings of HDFS; Cluster startup; NameNode startup; DataNode startup; Block replication; Command-line access to the HDFS filesystem; Exploring the HDFS filesystem; Protecting the filesystem metadata
5058 |a Secondary NameNode not to the rescueHadoop 2 NameNode HA; Keeping the HA NameNodes in sync; Client configuration; How a failover works; Apache ZooKeeper -- a different type of filesystem; Implementing a distributed lock with sequential ZNodes; Implementing group membership and leader election using ephemeral ZNodes; Java API; Building blocks; Further reading; Automatic NameNode failover; HDFS snapshots; Hadoop filesystems; Hadoop interfaces; Java FileSystem API; Libhdfs; Thrift ; Managing and serializing data; The Writable interface; Introducing the wrapper classes ; Array wrapper classes
5058 |a The Comparable and WritableComparable interfacesStoring data; Serialization and Containers; Compression; General-purpose file formats; Column-oriented data formats; RCFile; ORC; Parquet; Avro; Using the Java API; Summary; Chapter 3: Processing -- MapReduce and Beyond; MapReduce; Java API to MapReduce; The Mapper class; The Reducer class; The Driver class; Combiner; Partitioning; The optional partition function; Hadoop-provided mapper and reducer implementations; Sharing reference data; Writing MapReduce programs; Getting started; Running the examples; Local cluster; Elastic MapReduce
5058 |a WordCount, the Hello World of MapReduceWord co-occurrences; Trending topics; The Top N pattern; Sentiment of hashtags; Text cleanup using chain mapper; Walking through a run of a MapReduce job; Startup; Splitting the input; Task assignment; Task startup; Ongoing JobTracker monitoring; Mapper input; Mapper execution; Mapper output and reducer input; Reducer input; Reducer execution; Reducer output; Shutdown; Input/Output; InputFormat and RecordReader; Hadoop-provided InputFormat; Hadoop-provided RecordReader; OutputFormat and RecordWriter; Hadoop-provided OutputFormat; Sequence files; YARN
520 |a If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus.
546 |a English.
63000|a Apache Hadoop.
63007|a Apache Hadoop|2 fast
650 0|a Electronic data processing|x Distributed processing.
650 0|a Big data.
650 6|a Traitement réparti.
650 6|a Données volumineuses.
650 7|a COMPUTERS|x Computer Literacy.|2 bisacsh
650 7|a COMPUTERS|x Computer Science.|2 bisacsh
650 7|a COMPUTERS|x Data Processing.|2 bisacsh
650 7|a COMPUTERS|x Hardware|x General.|2 bisacsh
650 7|a COMPUTERS|x Information Technology.|2 bisacsh
650 7|a COMPUTERS|x Machine Theory.|2 bisacsh
650 7|a COMPUTERS|x Reference.|2 bisacsh
650 7|a Big data|2 fast
650 7|a Electronic data processing|x Distributed processing|2 fast
7001 |a Modena, Gabriele,|e author.
758 |i has work:|a Learning Hadoop 2 (Text)|1 https://id.oclc.org/worldcat/entity/E39PCH6x8qT3fQW4cHJGDfxrG3|4 https://id.oclc.org/worldcat/ontology/hasWork
77608|i Print version:|a Turkington, Garry.|t Learning Hadoop 2 : design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2.|d Birmingham, England ; Mumbai, India : Packt Publishing, ©2015|h x, 360 pages|k Community experience distilled.|z 9781783285518
830 0|a Community experience distilled.
85640|u http://ezproxy.ccu.edu/login?url=https://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&AN=956633
938 |a Askews and Holts Library Services|b ASKH|n AH28256048
938 |a EBSCOhost|b EBSC|n 956633
938 |a YBP Library Services|b YANK|n 12298529
94901|h 9|l cceb|s j|t 188|w EBSCO Academic : External
994 |a 92|b FCX