Dowload Lastest Books On Hadoop

Amazon Best Deal !!

Powered by Blogger.

Sunday, January 1, 2017

Introduction To Apache Flume


Introduction To Apache Flume

 


What is Flume?
Apache Flume is a tool/service/data ingestion mechanism for collecting aggregating and transporting large amounts of streaming data such as log files, events (etc...) from various sources to a centralized data store.
Flume is a highly reliable, distributed, and configurable tool. It is principally designed to copy streaming data (log data) from various web servers to HDFS.

Assume an e-commerce web application wants to analyze the customer behavior from a particular region. To do so, they would need to move the available log data into Hadoop for analysis. Here, Apache Flume comes to our rescue.
Flume is used to move the log data generated by application servers into HDFS at a higher speed.
Advantages of Flume:-
·       Using Apache Flume we can store the data into any of the centralized stores (HBase, HDFS).
·        When the rate of incoming data exceeds the rate at which data can be written to the destination, Flume acts as a mediator between data producers and the centralized stores and provides a steady flow of data between them.
·        Flume provides the feature of contextual routing.
·        The transactions in Flume are channel-based where two transactions (one sender and one receiver) are maintained for each message. It guarantees reliable message delivery.
·  Flume is reliable, fault tolerant, scalable, manageable, and customizable.
Features of Flume:-
·        Flume ingests log data from multiple web servers into a centralized store (HDFS, HBase) efficiently.
·        Using Flume, we can get the data from multiple servers immediately into Hadoop.
·        Along with the log files, Flume is also used to import huge volumes of event data produced by social networking sites like Facebook and Twitter, and e-commerce websites like Amazon and Flipkart.
·        Flume supports a large set of sources and destinations types.
·  Flume supports multi-hop flows, fan-in fan-out flows, contextual routing, etc.
·        Flume can be scaled horizontally.

13 comments:

  1. The blog gave me idea about apache flume My sincere thanks for sharing this post and please continue to share this kind of post
    Hadoop Training in Chennai

    ReplyDelete
  2. The explanations about the hadoop was very much useful My sincere Thanks for sharing this post and please continue to share this kind of post
    hadoop training in chennai

    ReplyDelete
  3. Very nice post here and thanks for it .I always like and such a super contents of these post.Excellent and very cool idea and great content of different kinds of the valuable information's.



    Hadoop Training in BTM Layout


    Hadoop Training in Marathahalli

    ReplyDelete
  4. This comment has been removed by the author.

    ReplyDelete
  5. This comment has been removed by the author.

    ReplyDelete
  6. It is nice blog Thank you porovide importent information and i am searching for same information to save my timeBig data hadoop online Course Hyderabad

    ReplyDelete
  7. Great Information sharing .. I am very happy to read this article .. thanks for giving us go through info.Fantastic nice. I appreciate this post. Data Blending in Tableau

    ReplyDelete
  8. cool stuff. thank you for sharing useful and productive information.

    data science training in noida

    ReplyDelete
  9. I’m excited to uncover this page. I need to to thank you for ones time for this particularly fantastic read !! I definitely really liked every part of it and i also have you saved to fav to look at new information in your site.
    data science certification

    ReplyDelete
  10. Through this post, I realize that your great information in playing with all the pieces was exceptionally useful. I advise this is the primary spot where I discover issues I've been scanning for. You have a smart yet alluring method of composing.
    certification of data science

    ReplyDelete
  11. This comment has been removed by the author.

    ReplyDelete