记录MYSQL同步数据至ES的测试历程

相关文章推荐

开朗的硬盘 · SQL Server 容易忽略的错误 - ...· 2 年前 ·

聪明伶俐的大象 · 350. 两个数组的交集2（Python） ...· 2 年前 ·

幸福的眼镜 · iOS-Swift ...· 2 年前 ·

潇洒的吐司 · Jenkins + Docker ...· 3 年前 ·

纯真的松球 · 在Python中，我怎样才能从主脚本中同时运 ...· 3 年前 ·

腾讯云

备案控制台

开发者社区

TVP

文章/答案/技术大牛

写文章

专栏首页进击的蘑菇记录MYSQL同步数据至ES的测试历程

2 0

分享

./bin/logstash -e 'input { stdin { } } output { stdout {} }'

OpenJDK 64-Bit Server VM warning: Option UseConcMarkSweepGC was deprecated in version 9.0 and will likely be removed in a future release

 Ignoring the 'pipelines.yml' file because modules or command line options are specified

bin/logstash -f first-pipeline.conf --config.test_and_exit
bin/logstash -f first-pipeline.conf --config.reload.automatic

input {
  jdbc {
    jdbc_driver_library => "/usr/src/mysql-connector-java-8.0.23/mysql-connector-java-8.0.23.jar"
    jdbc_driver_class => "com.mysql.jdbc.Driver"
    jdbc_connection_string => "jdbc:mysql://localhost:3306/NBA"
    jdbc_user => "root"
    jdbc_password => "root"
    jdbc_paging_enabled => true
    jdbc_page_size => "50"
    tracking_column => "unix_ts_in_secs"
    use_column_value => true
    tracking_column_type => "numeric"
    schedule => "*/5 * * * * *"
    statement => "SELECT *, UNIX_TIMESTAMP(update_time) AS unix_ts_in_secs FROM nba_test WHERE (UNIX_TIMESTAMP(update_time) > :sql_last_value AND update_time < NOW()) ORDER BY update_time ASC"
filter {
  ruby { 
    code => "
      event.set('Participation_date', event.get('Participation_date').time.localtime + 8*60*60)
      event.set('Retirement_date', event.get('Retirement_date').time.localtime + 8*60*60)
  mutate {
    copy => { "number" => "[@metadata][_id]"}
    remove_field => ["unix_ts_in_secs"]
  #用mutate插件先转换为string类型,gsub只处理string类型的数据，在用正则匹配，最终得到想要的日期
  convert => {
    "Participation_date"=> "string"
    "Retirement_date"=> "string"
  gsub => ["Participation_date", "T([\S\s]*?)Z", ""] 
  gsub => ["Retirement_date", "T([\S\s]*?)Z", ""]   
output {
  stdout { codec =>  "rubydebug"}
  elasticsearch {
      index => "nba20210413"
      document_id => "%{[@metadata][_id]}"

# 如jdbc中定义的parameters 
parameters => { "target_id" => "321" }
"SELECT * FROM MYTABLE WHERE id = :target_id"

event.set(....time.localtime + 8*60*60 + 5*60+43)

#官网logstash指导文章，简直就是引路人
https://www.elastic.co/cn/blog/how-to-keep-elasticsearch-synchronized-with-a-relational-database-using-logstash
# logstash指引
https://www.elastic.co/guide/en/logstash/current/getting-started-with-logstash.html
# 关于elasticsearch的format - failed to parse date field
https://github.com/elastic/elasticsearch/issues/43966
# ruby cody Expected string
https://discuss.elastic.co/t/need-some-help-with-ruby-split/189368/2
# 安装jdk9 - UseConcMarkSweepGC was deprecated in version 9.0
https://yanglinwei.blog.csdn.net/article/details/105146395
# Timestamp field has a 5 minutes delay

文科生的python自学之路

记录MYSQL同步数据至ES的测试历程

记录MYSQL同步数据至ES的测试历程