Hadoop: Data Analysis

Hadoop: Data Analysis

Website or Online Data - 2016
Rate this:
Hadoop is the cloud computing platform data scientists use to perform highly parallelized operations on big data. If you've explored Hadoop, you've probably discovered it has many levels of complexity. After getting comfortable with the fundamentals, you're ready to see how to put additional frameworks and tool sets to use. In this course, software engineer and data scientist Jack Dintruff goes beyond the basic capabilities of Hadoop. He demonstrates hands-on, project-based, practical skills for analyzing data, including how to use Pig to analyze large datasets and how to use Hive to manage large datasets in distributed storage. Learn how to configure the Hadoop distributed file system (HDFS), perform processing and ingestion using MapReduce, copy data from cluster to cluster, create data summarizations, and compose queries.
Learn how to use Hadoop utilities to set up, manage, and analyze large and distributed datasets. Learn how to work with HDFS, YARN, MapReduce, Hive, and Pig.
Publisher: Carpenteria, CA : lynda.com, 2016
Copyright Date: ©2016
Additional Contributors: lynda.com (Firm)
Call Number: eResearch

Opinion

From the critics


Community Activity

Comment

Add a Comment

There are no comments for this title yet.

Age

Add Age Suitability

There are no ages for this title yet.

Summary

Add a Summary

There are no summaries for this title yet.

Notices

Add Notices

There are no notices for this title yet.

Quotes

Add a Quote

There are no quotes for this title yet.

Explore Further

Recommendations

Subject Headings

  Loading...

Find it at DCL

  Loading...
[]
[]
To Top