/home/leansigm/public_html/components/com_easyblog/services

SigmaWay Blog

SigmaWay Blog tries to aggregate original and third party content for the site users. It caters to articles on Process Improvement, Lean Six Sigma, Analytics, Market Intelligence, Training ,IT Services and industries which SigmaWay caters to

Spark or Hadoop Which is a better Big Data framework?

Hadoop, for many years, was the leading open source Big Data framework but recently the newer and more advanced Spark has taken over. Spark is reported to be 100 times faster although it lacks its own distributed storage system. For this reason many projects involve installing Spark on top of Hadoop, where Spark’s advanced analytics can make use of data stored using the Hadoop Distributed File System (HDFS).
What really gives Spark the edge is speed. Spark handles most of its operations ‘in memory’- copying them from the distributed physical storage into far faster logical RAM memory. Spark’s speed of handling advanced data processing tasks such as real time stream processing and machine learning is much more than what could be achieved by Hadoop. Faster dynamic data handling gives Spark the upper hand over Hadoop.
However it must be concluded that these two frameworks are not necessarily mutually exclusive and do not perform exactly the same tasks. In fact using both of them together can actually provide better results than using either one separately.

For more information visit:
http://www.forbes.com/sites/bernardmarr/2015/06/22/spark-or-hadoop-which-is-the-best-big-data-framework/

 

 

Rate this blog entry:
4956 Hits
0 Comments

Spark or Hadoop Which is a better Big Data framework?

Hadoop, for many years, was the leading open source Big Data framework but recently the newer and more advanced Spark has taken over. Spark is reported to be 100 times faster although it lacks its own distributed storage system. For this reason many projects involve installing Spark on top of Hadoop, where Spark’s advanced analytics can make use of data stored using the Hadoop Distributed File System (HDFS).
What really gives Spark the edge is speed. Spark handles most of its operations ‘in memory’- copying them from the distributed physical storage into far faster logical RAM memory. Spark’s speed of handling advanced data processing tasks such as real time stream processing and machine learning is much more than what could be achieved by Hadoop. Faster dynamic data handling gives Spark the upper hand over Hadoop.
However it must be concluded that these two frameworks are not necessarily mutually exclusive and do not perform exactly the same tasks. In fact using both of them together can actually provide better results than using either one separately.

For more information visit:
http://www.forbes.com/sites/bernardmarr/2015/06/22/spark-or-hadoop-which-is-the-best-big-data-framework/

 

 

 

Rate this blog entry:
4284 Hits
0 Comments

Latest Revolution In Big Data Analytics- An insight

In today`s business world, time is a precious resource, so there is increasing demand for a faster system of data analysis. An emerging open source analytics tool called spark is slowly gaining popularity and is being used for getting faster results obtained from big data analysis. Earlier many companies used and relied on Hadoop for processing and analyzing information, which used to take days to arrive at a meaningful inference. The spark analytics technology uses a completely different technique for storing data than Hadoop, with spark data is directly written in the memory rather than storing in disks, which increases the speed of data analysis remarkably.  There are also other interesting technological features for which spark is gaining attention.To know more read:  http://blogs.wsj.com/cio/2015/06/03/spark-a-tool-at-big-datas-cutting-edge-helps-under-armour-perform-faster-analytics/?KEYWORDS=analytics

 

Rate this blog entry:
4793 Hits
0 Comments
Sign up for our newsletter

Follow us