In this blog we will learn basic concept all about big data and spark, and how to install it and how to start work.
Codersarts Assignment Help is a top rated professional website when we talk about programming help, project help, homework help and other computer science related assignments.
Big Data Assignment is highly professional concept with combination of Information technology and business management and the best use of large volume of electronic data. Students are not able to do the Big data assignment and its solution in effectively and we come with service to give you support. Here we have provide efficient way to handle big data assignment so you can use it in our business management.
What is Big Data ?
Big data is data that contains greater variety arriving in increasing volumes and with ever-higher velocity or we can say big data is technology which used to handle large volume of data.
We can define it many way like :
Big data is larger, more complex data sets
Big Data Define as - Lots of data , Big data is the term for a collection of data, etc.
Characteristics of Big Data
Volume : With big data, you’ll have to process high volumes of low-density, unstructured data.
Velocity : Velocity is the fast rate at which data is received and (perhaps) acted on.
Variety : The data that is stored can be in variable format. It can be in the form of text, diagram, graph audio etc.
Uses of Big Data
Product Development
Predictive Maintenance
Predictive Maintenance
Fraud and Compliance
Machine Learning
Operational Efficiency
Drive Innovation
Analytics
How it works
It involves three key :
Integrate : It use traditional data integration mechanisms, such as ETL (extract, transform, and load) generally aren’t up to the task.
Manage : Big data requires storage. Your storage solution can be in the cloud, on premises, or both . So you need to handle own storage.
Analyze : Your investment in big data pays off when you analyze and act on your data.
How to install Big data Hadoop it in windows
Here we will provide complete configure zip file so you can easily install it in the widow without set configuration. So please follow process to install it in window because it is difficult and no proper solution find to install it in windows on the internet.
Steps 1:
Check either Java 1.8.0 is already installed on your system or not, use "Javac -version" to check.
Step 2 :
If Java is not installed on your system then first install java under "C:\JAVA"
Step 3 :
Extract file Hadoop 2.8.0.tar.gz or Hadoop-2.8.0.zip and place under "C:\Hadoop-2.8.0".
Step 4 :
Set the path HADOOP_HOME Environment variable on windows 10(see Step 1,2,3 below).
Step 5 :
Set the path JAVA_HOME Environment variable on windows 10(see Step 1,2,3 below).
Step 6 :
Next we set the Hadoop bin directory path and JAVA bin directory path.
Configuration
You need to configure by editing four file, but here we also attached already configure file so you need to run directly without configure it.
Download Hadoop 2.8.0 (Link: http://www-eu.apache.org/dist/hadoop/common/hadoop-2.8.0/hadoop-2.8.0.tar.gz OR http://archive.apache.org/dist/hadoop/core//hadoop-2.8.0/hadoop-2.8.0.tar.gz)
Java JDK 1.8.0.zip (Link: http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html)
Replace bin in hadoop by this configure file :
Dowload file Hadoop Configuration.zip (Link: https://github.com/MuhammadBilalYar/HADOOP-INSTALLATION-ON-WINDOW-10/blob/master/Hadoop%20Configuration.zip)
Now it ready to run
Open cmd and typing command "hdfs namenode –format"
then new name node window open
To test run all window :
Open cmd and change directory to "C:\Hadoop-2.8.0\sbin" and type "start-all.cmd" to start apache.
Output look like that:
Now it run properly.
Other Codersarts services
If you like Codersarts blog and looking for Assignment help,Project help, Programming tutors help and suggestion you can send mail at contact@codersarts.com.
Please write your suggestion in comment section below if you find anything incorrect in this blog post
Comments