用flume-ng-sql-source 從mysql 抽取資料到kafka被storm消費

阿新 • • 發佈：2019-01-10

1.下載編譯flume-ng-sql-source 下載地址：https://github.com/keedio/flume-ng-sql-source.git

安裝說明文件編譯和拷貝jar包

2.編寫flume-ng 配置檔案

1.channels = ch-1
a1.sources = src-1
a1.sinks = k1
###########sql source#################
# For each one of the sources, the type is defined
a1.sources.src-1.type = org.keedio.flume.source.SQLSource
a1.sources.src-1.hibernate.connection.url = jdbc:mysql://172.16.43.21:3306/test
# Hibernate Database connection properties
a1.sources.src-1.hibernate.connection.user = hadoop
a1.sources.src-1.hibernate.connection.password = hadoop
a1.sources.src-1.hibernate.connection.autocommit = true
a1.sources.src-1.hibernate.dialect = org.hibernate.dialect.MySQL5Dialect
a1.sources.src-1.hibernate.connection.driver_class = com.mysql.jdbc.Driver
a1.sources.src-1.run.query.delay=5000
a1.sources.src-1.status.file.path = /home/hadoop/export/server/apache-flume-1.7.0-bin
a1.sources.src-1.status.file.name = sqlSource.status
# Custom query
a1.sources.src-1.start.from = 0
a1.sources.src-1.custom.query = select `id`, `str` from json_str where id >  
[email protected]$ order by id asc
a1.sources.src-1.batch.size = 1000
a1.sources.src-1.max.rows = 1000
a1.sources.src-1.hibernate.connection.provider_class = org.hibernate.connection.C3P0ConnectionProvider
a1.sources.src-1.hibernate.c3p0.min_size=1
a1.sources.src-1.hibernate.c3p0.max_size=10

################################################################
a1.channels.ch-1.type = memory
a1.channels.ch-1.capacity = 10000
a1.channels.ch-1.transactionCapacity = 10000
a1.channels.ch-1.byteCapacityBufferPercentage = 20
a1.channels.ch-1.byteCapacity = 800000

################################################################
a1.sinks.k1.type = org.apache.flume.sink.kafka.KafkaSink
a1.sinks.k1.topic = testuser
a1.sinks.k1.brokerList = test0:9092,test1:9092,test2:9092
a1.sinks.k1.requiredAcks = 1
a1.sinks.k1.batchSize = 20
a1.sinks.k1.channel = c1

a1.sinks.k1.channel = ch-1
a1.sources.src-1.channels=ch-1

3.遇到的問題

mysql中的內容採集到kafka中之後會多出來很多雙引號

mysql資料格式：

kafka資料格式：

用storm對kafka中的資料進行格子整理

用flume-ng-sql-source 從mysql 抽取資料到kafka被storm消費

1.下載編譯flume-ng-sql-source 下載地址：https://github.com/keedio/flume-ng-sql-source.git 安裝說明文件編譯和拷貝jar包 2.編寫flume-ng 配置檔案 1.channels = ch-1 a

使用 Binlog 和 Canal 從 MySQL 抽取資料

資料抽取是 ETL 流程的第一步。我們會將資料從 RDBMS 或日誌伺服器等外部系統抽取至資料倉庫，進行清洗、轉換、聚合等操作。在現代網站技術棧中，MySQL 是最常見的資料庫管理系統，我們會從多個不同的 MySQL 例項中抽取資料，存入一箇中心節點，或直接進入 Hive。市

sql server2017從MySQL匯入資料到SQL Server時報錯：截斷行處理設定指定截斷時出錯解決方案

今天晚上在跑SQL Server代理作業時報錯：錯誤: 0xC020902A，位於 credit_ious_instalment, 源 - 查詢 [2]: “源 - 查詢”由於發生截斷而失敗，而且針對“源 - 查詢.輸出[ADO NET 源輸出].列[ious_no]”的截斷行處理設定

從MongoDB抽取資料匯入mysql

# -*- coding: utf-8 -*- from pymongo import MongoClient import io import traceback import sys reload(sys) sys.setdefaultencoding('u

Oracle 使用SQL Loader 從外部匯入資料

在專案中經常會有一些基礎資料需要從Excel或其他檔案中匯入。大部分的格式都是樹結構。如果是這樣，我們對資料稍加整理，即可使用Oracle的資料匯入工具SQL Loader匯入我們所需要的資料到指定的表中。SQL Loader的詳細用法，可自己查詢相關詳細的文件，這裡只做簡單的使用介紹。 1

spark從mysql讀取資料（redis/mongdb/hbase等類似，換成各自RDD即可）

package com.ws.jdbc import java.sql.DriverManager import org.apache.spark.rdd.JdbcRDD import org.apache.spark.{SparkConf, SparkCont

學習筆記:從0開始學習大資料-28. solr儲存資料在hdfs並從mysql匯入資料

環境 centos7 hadoop2.6.0 solr-7.5.0 一、建立hdfs為儲存的core 1.在hdfs建立索引資料目錄 [[email protected] bin]# hadoop fs -mkdir /user/solr/ [[email&

從mysql將資料匯入hive

[[email protected] ~]$ sqoop import --connect jdbc:mysql://Hadoop48/toplists --verbose -m 1 --username root --hive-overwrite --direct --table award --

均衡負載方式搭建高可用的flume-ng環境寫入資訊到hadoop和kafka

應用場景為多臺agent推送本地日誌資訊到hadoop，由於agent和hadoop叢集處在不同的網段，資料量較大時可能出現網路壓力較大的情況，所以我們在hadoop一側的網段中部署了兩臺flume collector機器，將agent的資料傳送到collector上進行分

解決從Mysql讀取資料時出現括號和逗號的問題。

我們既然會把資料存進資料庫理面，為的就是再我們有需要的時候取出來，不過取出來的時候卻出現了奇怪的問題。比如說下面這種：那麼導致這樣的情況出現的程式碼長什麼樣呢，請看：items = cursor.fetchall() for row in items: print(r

Ansible 從MySQL數據庫添加或刪除用戶

pin upd sin 包括 notes 設置 -m boot replicat mysql_user - 從MySQL數據庫添加或刪除用戶。概要要求（在執行模塊的主機上）選項例子筆記狀態支持概要從MySQL數據庫添加或刪除用戶。

mysql用戶管理、常用sql語句、mysql數據庫備份恢復

mysql用法mysql用戶管理1、新增用戶user1，並設置密碼為123456mysql> grant all on *.* to ‘user1‘@‘127.0.0.1‘ identified by ‘123456‘;#創建user1用戶並授予其所有權限“*.*”（通配符）#第一個*：表示所有的數據庫

MySQL創建用戶以及授權、常用的sql語句、MySQL數據庫的備份與恢復

Linux學習筆記MySQL創建用戶以及授權常用的sql語句 MySQL數據庫的備份與恢復對於大數據的備份請查閱資料MySQL創建用戶以及授權、常用的sql語句、MySQL數據庫的備份與恢復

2.MySQL用戶管理，常用SQL語句，MySQL數據庫備份與恢復

MySQL用戶管理常用MySQL語句 MySQL數據備份與恢復 [toc] MySQL用戶管理，重用SQL語句，MySQL數據庫備份與恢復一、MySQL用戶管理 1.創建一個普通用戶並授權首先啟動mysql，然後進入 [root@xavi ~]# /etc/init.d/mysqld sta

MySQL常用操作（2）MySQL用戶管理、常用sql語句、 MySQL數據庫備份恢復

MySQL用戶管理 MySQL用戶管理創建一個普通用戶並且授權1.grant all on *.* to 'user1' identified by 'passwd';grant all on *.* to 'user1' iden

53.mysql用戶管理、常用sql語句、mysql數據庫備份恢復

mysql用戶管理常用sql語句 mysql數據庫備份恢復一、.mysql用戶管理 grant all on *.* to ‘user1‘@‘127.0.0.1‘ identified by ‘passwd‘; //創建以127.0.0.1訪問的用戶user1，密碼為passwd，對所有

MySQL創建用戶以及授權、常用SQL語句、 MySQL數據庫備份恢復

mysql 備份和恢復用戶創建與授權 MySQL創建用戶以及授權創建用戶：grant all on . to ‘user1‘@‘IP地址’identified by ‘passwd‘;（user1可以是自定義的用戶名，passwd為密碼。）給某個用戶授權：grant SELECT,UPDATE

Linux centosVMware mysql用戶管理、常用sql語句、mysql數據庫備份恢復

eat sql sele abc 數據庫 let 授權分享 1.10 一、mysql用戶管理 grant all on *.* to ‘user1‘@‘127.0.0.1’ identified by ‘mimA123‘; 創建user1用戶使用user1登錄 /us

Flume NG原始碼分析（四）使用ExecSource從本地日誌檔案中收集日誌

常見的日誌收集方式有兩種，一種是經由本地日誌檔案做媒介，非同步地傳送到遠端日誌倉庫，一種是基於RPC方式的同步日誌收集，直接傳送到遠端日誌倉庫。這篇講講Flume NG如何從本地日誌檔案中收集日誌。 ExecSource是用來執行本地shell命令，並把本地日誌檔案中的資料封裝成Event

【JEECG示例文件】使用Kettle從mysql向oracle中抽取資料

分享一下我老師大神的人工智慧教程！零基礎，通俗易懂！http://blog.csdn.net/jiangjunshow 也歡迎大家轉載本篇文章。分享知識，造福人民，實現我們中華民族偉大復興！

用flume-ng-sql-source 從mysql 抽取資料到kafka被storm消費

相關推薦