elasticsearchelasticsearch-jdbc-riverelasticsearch-plugin

Trying to use jdbc river ElasticSearch plugin for batch processing


I need to write some indexing jobs to run once per day that query our Oracle database tables and index into ElasticSearch. Some tables index first and others next since there are table dependencies. But around that process of indexing, I need to enhance the fields going into to the ES index, as well as log to our Oracle database table job statuses and perhaps even the records that have succeeded/failed the indexing process.

Can I use the Elastic Search plugin JDBC-River.


Solution

  • My concern was logging back to the RDBS via an insert statement after the query to extract from the DB. I got in touch with the creator of the jdbc-river. He mentioned this is how I should configure things: really helpful!

    curl -XDELETE '0:9200/_river/my_jdbc_river/
    
    
    curl -XPUT '0:9200/_river/my_jdbc_river/_meta' -d '
        {
            "type": "jdbc",
            "jdbc": {
                "url": "jdbc:mysql://localhost:3306/test",
                "user": "",
                "password": "",
                "schedule": "0 0-59 0-23 ? * *",
                "sql": [
                    {
                        "statement": "select *, created as _id, \"myjdbc\" as _index, \"mytype\" as _type from orders"
                    },
                    {
                        "statement": "insert into ack(n,t,c) values(?,?,?)",
                        "parameter": [
                            "$job",
                            "$now",
                            "$count"
                        ]
                    }
                ]
            }
        }'