{"id":46675,"date":"2023-09-15T01:11:41","date_gmt":"2024-02-18T10:14:54","guid":{"rendered":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/"},"modified":"2024-05-03T23:47:34","modified_gmt":"2024-05-03T15:47:34","slug":"46675-2","status":"publish","type":"post","link":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/","title":{"rendered":""},"content":{"rendered":"<p>Apache Flume\u3084Apache Kafka\u306f\u30ea\u30a2\u30eb\u30bf\u30a4\u30e0\u306a\u30a4\u30d9\u30f3\u30c8\u51e6\u7406\u306e\u30d0\u30c3\u30af\u30a8\u30f3\u30c9\u3068\u3057\u3066\u5e83\u304f\u5229\u7528\u3055\u308c\u3066\u3044\u307e\u3059\u3002\u3053\u308c\u3089\uff12\u3064\u306e\u30b7\u30b9\u30c6\u30e0\u306f\u4f3c\u3066\u3044\u308b\u90e8\u5206\u3082\u3042\u308a\u307e\u3059\u304c\u3001\u30e6\u30fc\u30b9\u30b1\u30fc\u30b9\u306b\u3088\u308a\u3069\u3061\u3089\u304b\u4e00\u65b9\u3001\u3042\u308b\u3044\u306f\u91cf\u3092\u7d44\u307f\u5408\u308f\u305b\u3066\u4f7f\u3046\u5834\u5408\u3082\u3042\u308a\u307e\u3059\u3002<\/p>\n<p>Flume\u3068Kafka\u306e\u9055\u3044\u306f\u6b21\u306e\u30d6\u30ed\u30b0\u3082\u53c2\u8003\u306b\u306a\u308a\u307e\u3059\u3002<br \/>\nhttps:\/\/www.linkedin.com\/pulse\/flume-kafka-real-time-event-processing-lan-jiang<\/p>\n<h1>Apache Kafka<\/h1>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/3-0.png\" alt=\"kafka_diagram.png\" \/><\/div>\n<p>\u3057\u304b\u3057\u3001Kafka\u3092\u4f7f\u3046\u5834\u5408\u3001\u4e00\u822c\u7684\u306b\u30d7\u30ed\u30c7\u30e5\u30fc\u30b5\u3084\u30b3\u30f3\u30b7\u30e5\u30fc\u30de\u306e\u305f\u3081\u306e\u30b3\u30fc\u30c9\u3092\u8a18\u8ff0\u3059\u308b\u5fc5\u8981\u304c\u3042\u308a\u307e\u3059\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Producer\u306e\u30b3\u30fc\u30c9\u306e\u4f8b (https:\/\/github.com\/bkimminich\/apache-kafka-book-examples\/blob\/master\/src\/test\/kafka\/SimpleProducer.java)<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Consumer\u306e\u30b3\u30fc\u30c9\u306e\u4f8b<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">(https:\/\/github.com\/bkimminich\/apache-kafka-book-examples\/tree\/master\/src\/test\/kafka\/consumer)<\/ul>\n<p>kafka-topics.sh\u3084kafka-console-producer.sh\u306e\u3088\u3046\u306a\u30e6\u30fc\u30c6\u30a3\u30ea\u30c6\u30a3\u30b3\u30de\u30f3\u30c9\u3092\u4f7f\u7528\u3057\u3066\u30b3\u30de\u30f3\u30c9\u30e9\u30a4\u30f3\u304b\u3089Kafka\u3092\u5229\u7528\u3059\u308b\u3053\u3068\u3082\u3067\u304d\u307e\u3059\u304c\u3001\u90fd\u5ea6\u30b3\u30de\u30f3\u30c9\u3092\u53e9\u304f\u306e\u306f\u96e3\u3057\u3044\u3067\u3059\u3057\u3001\u30a2\u30d7\u30ea\u30b1\u30fc\u30b7\u30e7\u30f3\u3068\u9023\u643a\u3059\u308b\u5834\u5408\u306f\u30b3\u30fc\u30c9\u3092\u8a18\u8ff0\u3059\u308b\u3053\u3068\u306b\u306a\u308b\u3067\u3057\u3087\u3046\u3002<\/p>\n<p>\u3057\u304b\u3057\u3001Flafka\u3092\u4f7f\u3048\u3070\u3001\u30b3\u30fc\u30c9\u3092\u8a18\u8ff0\u3059\u308b\u3053\u3068\u306a\u304fKafka\u3068\u9023\u643a\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002<\/p>\n<h1>Flafka\u3068\u306f\uff1f<\/h1>\n<p>Flafka\u306fFlume\u3068Kafka\u9023\u643a\u306e\u4fd7\u540d\uff08\uff1f\uff09\u3067\u3059\u3002Kafka\u3092Flume\u306e\u30bd\u30fc\u30b9\uff08\u5165\u529b\uff09\u3084\u30b7\u30f3\u30af\uff08\u51fa\u529b\uff09\u3001\u307e\u305f\u306f\u30c1\u30e3\u30f3\u30cd\u30eb\uff08\u30d0\u30c3\u30d5\u30a1\uff09\u3068\u3057\u3066\u5229\u7528\u3059\u308b\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002\u3064\u307e\u308a\u3001Flume\u306e\u30d7\u30ed\u30d1\u30c6\u30a3\u30d5\u30a1\u30a4\u30eb\u306bKafka\u306e\u8a2d\u5b9a\u3092\u884c\u3046\u3060\u3051\u3067\u3001\u30b3\u30fc\u30c9\u3092\u5229\u7528\u305b\u305a\u306b\u9023\u643a\u3067\u304d\u308b\u3068\u3044\u3046\u3053\u3068\u3067\u3059\u3002\u3068\u3063\u3066\u3082\u7c21\u5358\u3002<\/p>\n<p>2016\/10\/25\u88dc\u8db3: \u4e0b\u8a18\u306fFlume 1.6\u3067\u306e\u8a2d\u5b9a\u3067\u3059\u3002Flume1.7\u3067\u306fKafka 0.9\u5bfe\u5fdc\u306e\u305f\u3081\u3001\u30d7\u30ed\u30d1\u30c6\u30a3\u306e\u8a18\u8ff0\u65b9\u6cd5\u304c\u5909\u66f4\u3055\u308c\u3066\u3044\u307e\u3059\u30021<\/p>\n<h2>Kafka\u306e\u30c8\u30d4\u30c3\u30af\u306b\u30c7\u30fc\u30bf\u3092\u66f8\u304d\u51fa\u3059 (Kafka Sink)<\/h2>\n<p>Flume\u306e\u3055\u307e\u3056\u307e\u306a\u30c7\u30fc\u30bf\u30bd\u30fc\u30b9\uff08\u30d5\u30a1\u30a4\u30eb\u306etail\u3001\u3042\u308b\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u306b\u51fa\u529b\u3055\u308c\u305f\u30d5\u30a1\u30a4\u30eb\u3001twitter\u306a\u3069\uff09\u3092Kafka\u306b\u53d6\u308a\u8fbc\u3080\u4f8b\u3067\u3059\u3002Flume\u306e\u30c7\u30fc\u30bf\u30bd\u30fc\u30b9\u3084\u30b7\u30f3\u30af\u306e\u7d30\u304b\u3044\u8a2d\u5b9a\u306f\u30e6\u30fc\u30b6\u30fc\u30ac\u30a4\u30c9\u3092\u53c2\u7167<br \/>\nhttps:\/\/flume.apache.org\/FlumeUserGuide.html<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/13-0.jpeg\" alt=\"flafka1.jpg\" \/><\/div>\n<h3>Kafka Sink\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb<\/h3>\n<p>Flume\u306espoolDir\u3092\u4f7f\u3046\u3068\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u3092\u76e3\u8996\u3057\u3066\u3001\u3053\u306e\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u306b\u8ffd\u52a0\u3055\u308c\u305f\u30d5\u30a1\u30a4\u30eb\u306e\u5185\u5bb9\u3092\uff11\u884c\u6bce\u306b\u30ec\u30b3\u30fc\u30c9\u3068\u3057\u3066\u53d6\u308a\u8fbc\u307f\u307e\u3059\u3002\u307e\u305f\u3001\u30b7\u30f3\u30af\u306e\u8a2d\u5b9a\u3067Kafka\u306e\u30c8\u30d4\u30c3\u30af\u3092\u6307\u5b9a\u3057\u3066\u3044\u307e\u3059\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u76e3\u8996\u30c7\u30a3\u30ec\u30af\u30c8\u30ea: \/flume\/weblogs<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">Kafka\u306e\u30c8\u30d4\u30c3\u30af: eventtopic<\/ul>\n<p>\u3053\u306e\u5834\u5408\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb(spooldir_sample.conf)\u306f\u4ee5\u4e0b\u306e\u3088\u3046\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"c\"># \u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u30b3\u30f3\u30dd\u30fc\u30c8\u306e\u540d\u524d\r\n<\/span><span class=\"py\">agent.sources<\/span> <span class=\"p\">=<\/span> <span class=\"s\">webserver-log-source<\/span>\r\n<span class=\"py\">agent.sinks<\/span> <span class=\"p\">=<\/span> <span class=\"s\">kafka-sink<\/span>\r\n<span class=\"py\">agent.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u30bd\u30fc\u30b9\u306e\u8a2d\u5b9a\u3002\/flume\/weblogs\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u306b\u66f8\u304b\u308c\u305f\u30d5\u30a1\u30a4\u30eb\u306e\u5185\u5bb9\u3092Kafka\u306e\u30c8\u30d4\u30c3\u30af\u306b\u51fa\u529b\u3055\u305b\u308b\r\n<\/span><span class=\"py\">agent.sources.webserver-log-source.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">spooldir<\/span>\r\n<span class=\"py\">agent.sources.webserver-log-source.spoolDir<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\/flume\/weblogs<\/span>\r\n<span class=\"py\">agent.sources.webserver-log-source.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u51fa\u529b\u3092Kafka\u306eeventtopic\u30c8\u30d4\u30c3\u30af\u306b\u3059\u308b\r\n<\/span><span class=\"py\">agent.sinks.kafka-sink.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">org.apache.flume.sink.kafka.KafkaSink<\/span>\r\n<span class=\"py\">agent.sinks.kafka-sink.topic<\/span> <span class=\"p\">=<\/span> <span class=\"s\">eventtopic<\/span>\r\n<span class=\"py\">agent.sinks.kafka-sink.brokerList<\/span> <span class=\"p\">=<\/span> <span class=\"s\">localhost:9092<\/span>\r\n<span class=\"py\">agent.sinks.kafka-sink.batchSize<\/span> <span class=\"p\">=<\/span> <span class=\"s\">20<\/span>\r\n<span class=\"py\">agent.sinks.kafka-sink.channel<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n\r\n\r\n<span class=\"c\"># Flume\u306e\u30d0\u30c3\u30d5\u30a1\u306f\u30e1\u30e2\u30ea\r\n<\/span><span class=\"py\">agent4.channels.memory-channel.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory<\/span>\r\n<span class=\"py\">agent4.channels.memory-channel.capacity<\/span> <span class=\"p\">=<\/span> <span class=\"s\">100000<\/span>\r\n<span class=\"py\">agent4.channels.memory-channel.transactionCapacity<\/span> <span class=\"p\">=<\/span> <span class=\"s\">1000<\/span>\r\n<\/code><\/pre>\n<h3>Flume\u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u5b9f\u884c\u4f8b<\/h3>\n<p>\u4e0b\u8a18\u306e\u30b3\u30de\u30f3\u30c9\u3092\u5b9f\u884c\u3059\u308b\u3068\u3001\/flume\/weblogs\u306b\u30d5\u30a1\u30a4\u30eb\u304c\u8ffd\u52a0\u3055\u308c\u308b\u6bce\u306b\u3001Kafka\u306e\u30c8\u30d4\u30c3\u30af(eventtopic)\u306b\u30c7\u30fc\u30bf\u3092\u9001\u4fe1\u3057\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"nv\">$ <\/span>flume-ng agent <span class=\"nt\">--conf<\/span> \/etc\/flume-ng\/conf \u00a5\r\n<span class=\"nt\">--conf-file<\/span> \/home\/kawasaki\/spooldir_sample.conf \u00a5\r\n<span class=\"nt\">--name<\/span> agent \u00a5\r\n<span class=\"nt\">-Dflume<\/span>.root.logger<span class=\"o\">=<\/span>INFO,console\r\n<\/code><\/pre>\n<h2>Kafka\u306e\u30c8\u30d4\u30c3\u30af\u304b\u3089\u30c7\u30fc\u30bf\u3092\u8aad\u307f\u8fbc\u3093\u3067\u51fa\u529b\u3059\u308b (Kafka Source)<\/h2>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/23-0.jpeg\" alt=\"flafka2.jpg\" \/><\/div>\n<h3>Kafka Source\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb<\/h3>\n<p>Flume\u306eKafka Source\u53ca\u3073hdfs-sink\u3092\u4f7f\u3046\u3068\u3001Kafka\u306e\u30c8\u30d4\u30c3\u30af\u304b\u3089\u53d6\u308a\u8fbc\u3093\u3060\u30c7\u30fc\u30bf\u3092HDFS\u306b\u51fa\u529b\u3057\u307e\u3059\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">Kafka\u306e\u30c8\u30d4\u30c3\u30af: eventtopic<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">HDFS\u306e\u51fa\u529b\u30c7\u30a3\u30ec\u30af\u30c8\u30ea: \/user\/kawasaki\/hdfsstore<\/ul>\n<p>\u3053\u306e\u5834\u5408\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb(kafka_hdfs.conf)\u306f\u4ee5\u4e0b\u306e\u3088\u3046\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"c\"># \u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u30b3\u30f3\u30dd\u30fc\u30c8\u540d\r\n<\/span><span class=\"py\">agent.sources<\/span> <span class=\"p\">=<\/span> <span class=\"s\">kafka-source<\/span>\r\n<span class=\"py\">agent.sinks<\/span> <span class=\"p\">=<\/span> <span class=\"s\">hdfs-sink<\/span>\r\n<span class=\"py\">agent.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n\r\n<span class=\"c\"># Kafka\u3092Flume\u306e\u30bd\u30fc\u30b9\u306b\u3059\u308b (Kafka\u306e\u30c8\u30d4\u30c3\u30af\u306feventtopic)\r\n<\/span><span class=\"py\">agent.sources.kafka-source.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">org.apache.flume.source.kafka.KafkaSource<\/span>\r\n<span class=\"py\">agent2.sources.kafka-source.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n<span class=\"py\">agent2.sources.kafka-source.zookeeperConnect<\/span> <span class=\"p\">=<\/span> <span class=\"s\">localhost:2181<\/span>\r\n<span class=\"py\">agent2.sources.kafka-source.topic<\/span> <span class=\"p\">=<\/span> <span class=\"s\">eventtopic<\/span>\r\n<span class=\"py\">agent2.sources.kafka-source.groupId<\/span> <span class=\"p\">=<\/span> <span class=\"s\">flume<\/span>\r\n<span class=\"py\">agent2.sources.kafka-source.kafka.consumer.timeout.ms<\/span> <span class=\"p\">=<\/span> <span class=\"s\">100<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u51fa\u529b\u3092HDFS\u306b\u3059\u308b\r\n<\/span><span class=\"py\">agent.sinks.hdfs-sink.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">hdfs<\/span>\r\n<span class=\"py\">agent.sinks.hdfs-sink.hdfs.path<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\/user\/kawasaki\/hdfsstore<\/span>\r\n<span class=\"py\">agent.sinks.hdfs-sink.channel<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory-channel<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u30d0\u30c3\u30d5\u30a1\u306f\u30e1\u30e2\u30ea\u3092\u4f7f\u3046\r\n<\/span><span class=\"py\">agent.channels.memory-channel.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">memory<\/span>\r\n<span class=\"py\">agent.channels.memory-channel.capacity<\/span> <span class=\"p\">=<\/span> <span class=\"s\">100000<\/span>\r\n<span class=\"py\">agent.channels.memory-channel.transactionCapacity<\/span> <span class=\"p\">=<\/span> <span class=\"s\">1000<\/span>\r\n<\/code><\/pre>\n<h3>Flume\u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u5b9f\u884c\u4f8b<\/h3>\n<p>\u4e0b\u8a18\u306e\u30b3\u30de\u30f3\u30c9\u3092\u5b9f\u884c\u3059\u308b\u3068\u3001Kafka\u306e\u30c8\u30d4\u30c3\u30af(eventtopic)\u304b\u3089\u30c7\u30fc\u30bf\u3092\u53d6\u308a\u51fa\u3057\u3066HDFS\u306b\u66f8\u304d\u8fbc\u307f\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"nv\">$ <\/span>flume-ng agent <span class=\"nt\">--conf<\/span> \/etc\/flume-ng\/conf \u00a5\r\n<span class=\"nt\">--conf-file<\/span> \/home\/kawasaki\/kafka_hdfs.conf \u00a5\r\n<span class=\"nt\">--name<\/span> agent \u00a5\r\n<span class=\"nt\">-Dflume<\/span>.root.logger<span class=\"o\">=<\/span>INFO,console\r\n<\/code><\/pre>\n<h2>Flume\u306e\u30c1\u30e3\u30f3\u30cd\u30eb\u3068\u3057\u3066Kafka\u3092\u4f7f\u7528\u3059\u308b(Kafka Channel)<\/h2>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/33-0.jpeg\" alt=\"flafka3.jpg\" \/><\/div>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/34-0.jpeg\" alt=\"flafka4.jpg\" \/><\/div>\n<h3>Kafka Channel\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb<\/h3>\n<p>Flume\u306eKafka Channel\u306e\u4f8b<br \/>\n* \u76e3\u8996\u30c7\u30a3\u30ec\u30af\u30c8\u30ea: \/flume\/weblogs<br \/>\n* HDFS\u306e\u51fa\u529b\u30c7\u30a3\u30ec\u30af\u30c8\u30ea: \/user\/kawasaki\/hdfsstore<br \/>\n* Kafka\u306e\u30c8\u30d4\u30c3\u30af: eventtopic<\/p>\n<p>\u3053\u306e\u5834\u5408\u306e\u8a2d\u5b9a\u30d5\u30a1\u30a4\u30eb(kafka_channel.conf)\u306f\u4ee5\u4e0b\u306e\u3088\u3046\u306b\u306a\u308a\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"c\"># \u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u30b3\u30f3\u30dd\u30fc\u30c8\u540d\r\n<\/span><span class=\"py\">agent.sources<\/span> <span class=\"p\">=<\/span> <span class=\"s\">webserver-log-source<\/span>\r\n<span class=\"py\">agent.sinks<\/span> <span class=\"p\">=<\/span> <span class=\"s\">hdfs-sink<\/span>\r\n<span class=\"py\">agent.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">kafka-channel<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u30bd\u30fc\u30b9\u306e\u8a2d\u5b9a\u3002\/flume\/weblogs\u30c7\u30a3\u30ec\u30af\u30c8\u30ea\u306b\u66f8\u304b\u308c\u305f\u30d5\u30a1\u30a4\u30eb\u306e\u5185\u5bb9\u3092Kafka\u306e\u30c8\u30d4\u30c3\u30af\u306b\u51fa\u529b\u3055\u305b\u308b\r\n<\/span><span class=\"py\">agent.sources.webserver-log-source.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">spooldir<\/span>\r\n<span class=\"py\">agent.sources.webserver-log-source.spoolDir<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\/flume\/weblogs<\/span>\r\n<span class=\"py\">agent.sources.webserver-log-source.channels<\/span> <span class=\"p\">=<\/span> <span class=\"s\">kafka-channel<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u51fa\u529b\u3092HDFS\u306b\u3059\u308b\r\n<\/span><span class=\"py\">agent.sinks.hdfs-sink.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">hdfs<\/span>\r\n<span class=\"py\">agent.sinks.hdfs-sink.hdfs.path<\/span> <span class=\"p\">=<\/span> <span class=\"s\">\/user\/kawasaki\/hdfsstore<\/span>\r\n<span class=\"py\">agent.sinks.hdfs-sink.channel<\/span> <span class=\"p\">=<\/span> <span class=\"s\">kafka-channel<\/span>\r\n<span class=\"py\">agent.sinks.hdfs-sink.hdfs.fileType<\/span> <span class=\"p\">=<\/span> <span class=\"s\">DataStream<\/span>\r\n\r\n<span class=\"c\"># Flume\u306e\u30d0\u30c3\u30d5\u30a1\uff08\u30c1\u30e3\u30f3\u30cd\u30eb\uff09\u306fKafka\u306b\u3059\u308b\r\n<\/span><span class=\"py\">agent.channels.kafka-channel.type<\/span> <span class=\"p\">=<\/span> <span class=\"s\">org.apache.flume.channel.kafka.KafkaChannel<\/span>\r\n<span class=\"py\">agent.channels.kafka-channel.brokerList<\/span> <span class=\"p\">=<\/span> <span class=\"s\">localhost:9092<\/span>\r\n<span class=\"py\">agent.channels.kafka-channel.zookeeperConnect<\/span> <span class=\"p\">=<\/span> <span class=\"s\">localhost:2181<\/span>\r\n<span class=\"py\">agent.channels.kafka-channel.topic<\/span> <span class=\"p\">=<\/span> <span class=\"s\">eventtopic<\/span>\r\n<\/code><\/pre>\n<h3>Flume\u30a8\u30fc\u30b8\u30a7\u30f3\u30c8\u306e\u5b9f\u884c\u4f8b<\/h3>\n<p>\u4e0b\u8a18\u306e\u30b3\u30de\u30f3\u30c9\u3092\u5b9f\u884c\u3059\u308b\u3068\u3001\/flume\/weblogs\u306b\u66f8\u304d\u8fbc\u307e\u308c\u305f\u30d5\u30a1\u30a4\u30eb\u3092\u8aad\u307f\u51fa\u3057\u3001HDFS\u306b\u66f8\u304d\u8fbc\u307f\u307e\u3059\u3002<\/p>\n<pre class=\"post-pre\"><code><span class=\"nv\">$ <\/span>flume-ng agent <span class=\"nt\">--conf<\/span> \/etc\/flume-ng\/conf \u00a5\r\n<span class=\"nt\">--conf-file<\/span> \/home\/kawasaki\/kafka_channel.conf \u00a5\r\n<span class=\"nt\">--name<\/span> agent \u00a5\r\n<span class=\"nt\">-Dflume<\/span>.root.logger<span class=\"o\">=<\/span>INFO,console\r\n<\/code><\/pre>\n<h2>\u5fdc\u7528<\/h2>\n<p>Kafka\u3068Flume\u3092\u3092\u7d44\u307f\u5408\u308f\u305b\u308b\u3053\u3068\u3067\u3001\u5fdc\u7528\u3068\u3057\u3066\u6b21\u306e\u3088\u3046\u306a\u3053\u3068\u304c\u3067\u304d\u307e\u3059\u3002<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">\u30b9\u30c8\u30ea\u30fc\u30e0\u30f3\u30b0\u3067Kafka\u306b\u30c7\u30fc\u30bf\u3092\u53d6\u308a\u8fbc\u307f\u3001\u4e00\u65b9\u306fSpark Streaming\u3067\u30cb\u30a2\u30ea\u30a2\u30eb\u30bf\u30a4\u30e0\u306b\u51e6\u7406<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\u3082\u3046\u4e00\u65b9\u306fHDFS\u3084HBase\u306b\u4fdd\u5b58\u3057\u3066\u30d0\u30c3\u30c1\u51e6\u7406\u3067\u5229\u7528<\/ul>\n<p>Cloudera\u306e\u30d6\u30ed\u30b0\u306b\u3001Kafka\u3092\u4f7f\u3063\u305f\u30af\u30ec\u30b8\u30c3\u30c8\u30ab\u30fc\u30c9\u306e\u4e0d\u6b63\u691c\u77e5\u30a2\u30fc\u30ad\u30c6\u30af\u30c1\u30e3\u306e\u8a2d\u8a08\u306b\u3064\u3044\u3066\u306e\u7d75\u304c\u3042\u308b\u306e\u3067\u53c2\u8003\u306b\u3057\u3066\u307f\u3066\u304f\u3060\u3055\u3044\u3002(http:\/\/blog.cloudera.com\/blog\/2015\/07\/designing-fraud-detection-architecture-that-works-like-your-brain-does\/)<\/p>\n<div><img decoding=\"async\" class=\"post-images\" title=\"\" src=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/46-0.jpeg\" alt=\"flafka5.jpg\" \/><\/div>\n<p>Kafka\u3068Spark Streaming\u3068\u306e\u9023\u643a\u306e\u8a71\u306f\u5225\u306e\u6a5f\u4f1a\u306b&#8230;<\/p>\n<h2>\u307e\u3068\u3081<\/h2>\n<p>Flume\u306fCDH\u306b\u542b\u307e\u308c\u3066\u304a\u308a\u3001Cloudera Manager\u3092\u4f7f\u3048\u3070\u7c21\u5358\u306b\u5c0e\u5165\u3001\u8a2d\u5b9a\u3067\u304d\u307e\u3059\u3002\u4ee5\u524d\u7d39\u4ecb\u3057\u305fStreamSets\u306a\u3069\u3092\u7528\u3044\u3066\u30c7\u30fc\u30bf\u30d5\u30ed\u30fc\u3092\u5b9a\u7fa9\u3059\u308b\u3053\u3068\u3082\u3067\u304d\u307e\u3059\u304c\u3001Flafka\u306e\u826f\u3044\u3068\u3053\u308d\u306f\u5916\u90e8\u30b7\u30b9\u30c6\u30e0\u3092\u4f7f\u3046\u3053\u3068\u3082\u306a\u304f\u3001\u30b7\u30f3\u30d7\u30eb(\u3067\u3059\u304c\u5f37\u529b\uff09\u306b\u69cb\u7bc9\u3067\u304d\u308b\u3068\u3053\u308d\u3067\u3059\u306d\u3002<\/p>\n<h2>\u53c2\u8003\u8cc7\u6599<\/h2>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">http:\/\/blog.cloudera.com\/blog\/2014\/11\/flafka-apache-flume-meets-apache-kafka-for-event-processing\/<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">http:\/\/blog.cloudera.com\/blog\/2016\/08\/new-in-cloudera-enterprise-5-8-flafka-improvements-for-real-time-data-ingest\/<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">\n<li style=\"list-style-type: none;\">\n<ul class=\"post-ul\">http:\/\/blog.cloudera.com\/blog\/2015\/07\/designing-fraud-detection-architecture-that-works-like-your-brain-does\/<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<ul class=\"post-ul\">https:\/\/www.linkedin.com\/pulse\/flume-kafka-real-time-event-processing-lan-jiang<\/ul>\n<div>\n<p>https:\/\/blog.cloudera.com\/blog\/2016\/08\/new-in-cloudera-enterprise-5-8-flafka-improvements-for-real-time-data-ingest\/\u00a0\u21a9<\/p>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Apache Flume\u3084Apache Kafka\u306f\u30ea\u30a2\u30eb\u30bf\u30a4\u30e0\u306a\u30a4\u30d9\u30f3\u30c8\u51e6\u7406\u306e\u30d0\u30c3\u30af\u30a8\u30f3\u30c9\u3068\u3057\u3066\u5e83\u304f\u5229\u7528\u3055\u308c [&hellip;]<\/p>\n","protected":false},"author":10,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1],"tags":[],"class_list":["post-46675","post","type-post","status-publish","format-standard","hentry","category-uncategorized"],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v21.5 (Yoast SEO v21.5) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>- Blog - Silicon Cloud<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/\" \/>\n<meta property=\"og:locale\" content=\"zh_CN\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:description\" content=\"Apache Flume\u3084Apache Kafka\u306f\u30ea\u30a2\u30eb\u30bf\u30a4\u30e0\u306a\u30a4\u30d9\u30f3\u30c8\u51e6\u7406\u306e\u30d0\u30c3\u30af\u30a8\u30f3\u30c9\u3068\u3057\u3066\u5e83\u304f\u5229\u7528\u3055\u308c [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/\" \/>\n<meta property=\"og:site_name\" content=\"Blog - Silicon Cloud\" \/>\n<meta property=\"article:published_time\" content=\"2024-02-18T10:14:54+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-05-03T15:47:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/3-0.png\" \/>\n<meta name=\"author\" content=\"\u5b87, \u534e\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"\u4f5c\u8005\" \/>\n\t<meta name=\"twitter:data1\" content=\"\u5b87, \u534e\" \/>\n\t<meta name=\"twitter:label2\" content=\"\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4\" \/>\n\t<meta name=\"twitter:data2\" content=\"3 \u5206\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/\",\"name\":\"- Blog - Silicon Cloud\",\"isPartOf\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\"},\"datePublished\":\"2024-02-18T10:14:54+00:00\",\"dateModified\":\"2024-05-03T15:47:34+00:00\",\"author\":{\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/513018e4e121d3add1b7c5de8be21458\"},\"inLanguage\":\"zh-Hans\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/\"]}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#website\",\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/\",\"name\":\"Blog - Silicon Cloud\",\"description\":\"\",\"inLanguage\":\"zh-Hans\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/513018e4e121d3add1b7c5de8be21458\",\"name\":\"\u5b87, \u534e\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/63cd45cbc05a35fc4ff7637a163c83c4962ef58d27472726c3a3e0c9c5194f0f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/63cd45cbc05a35fc4ff7637a163c83c4962ef58d27472726c3a3e0c9c5194f0f?s=96&d=mm&r=g\",\"caption\":\"\u5b87, \u534e\"},\"url\":\"https:\/\/www.silicloud.com\/zh\/blog\/author\/yuhua\/\"},{\"@type\":\"ImageObject\",\"inLanguage\":\"zh-Hans\",\"@id\":\"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/#local-main-organization-logo\",\"url\":\"\",\"contentUrl\":\"\",\"caption\":\"Blog - Silicon Cloud\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"- Blog - Silicon Cloud","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/","og_locale":"zh_CN","og_type":"article","og_description":"Apache Flume\u3084Apache Kafka\u306f\u30ea\u30a2\u30eb\u30bf\u30a4\u30e0\u306a\u30a4\u30d9\u30f3\u30c8\u51e6\u7406\u306e\u30d0\u30c3\u30af\u30a8\u30f3\u30c9\u3068\u3057\u3066\u5e83\u304f\u5229\u7528\u3055\u308c [&hellip;]","og_url":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/","og_site_name":"Blog - Silicon Cloud","article_published_time":"2024-02-18T10:14:54+00:00","article_modified_time":"2024-05-03T15:47:34+00:00","og_image":[{"url":"https:\/\/cdn.silicloud.com\/blog-img\/blog\/img\/657d660937434c4406d08f3f\/3-0.png"}],"author":"\u5b87, \u534e","twitter_card":"summary_large_image","twitter_misc":{"\u4f5c\u8005":"\u5b87, \u534e","\u9884\u8ba1\u9605\u8bfb\u65f6\u95f4":"3 \u5206"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/","url":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/","name":"- Blog - Silicon Cloud","isPartOf":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website"},"datePublished":"2024-02-18T10:14:54+00:00","dateModified":"2024-05-03T15:47:34+00:00","author":{"@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/513018e4e121d3add1b7c5de8be21458"},"inLanguage":"zh-Hans","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/"]}]},{"@type":"WebSite","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#website","url":"https:\/\/www.silicloud.com\/zh\/blog\/","name":"Blog - Silicon Cloud","description":"","inLanguage":"zh-Hans"},{"@type":"Person","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/513018e4e121d3add1b7c5de8be21458","name":"\u5b87, \u534e","image":{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/63cd45cbc05a35fc4ff7637a163c83c4962ef58d27472726c3a3e0c9c5194f0f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/63cd45cbc05a35fc4ff7637a163c83c4962ef58d27472726c3a3e0c9c5194f0f?s=96&d=mm&r=g","caption":"\u5b87, \u534e"},"url":"https:\/\/www.silicloud.com\/zh\/blog\/author\/yuhua\/"},{"@type":"ImageObject","inLanguage":"zh-Hans","@id":"https:\/\/www.silicloud.com\/zh\/blog\/46675-2\/#local-main-organization-logo","url":"","contentUrl":"","caption":"Blog - Silicon Cloud"}]}},"_links":{"self":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46675","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/users\/10"}],"replies":[{"embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/comments?post=46675"}],"version-history":[{"count":2,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46675\/revisions"}],"predecessor-version":[{"id":94833,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/posts\/46675\/revisions\/94833"}],"wp:attachment":[{"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/media?parent=46675"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/categories?post=46675"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.silicloud.com\/zh\/blog\/wp-json\/wp\/v2\/tags?post=46675"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}