I would like to know where the hive-site.xml file configuration is in a Cloudera distribution.
Mainly because I would like to know where I can find out properties like:
hive.execution.engine=mr
hive.enforce.bucketing=false
hive.enforce.bucketmapjoin=false
hive.enforce.sorting=false
Maybe because I could want to overwrite some of them.
I know I can overwrite them in the Hive shell, but that is only for the current session.
Or I could create a .hiverc file to initialize Hive overwriting some properties.
But I would like to know where the properties are in Cloudera distribution.
I was trying this:
$ cd /
$ sudo find -name hive-site.xml
./etc/impala/conf.dist/hive-site.xml
./etc/spark/conf.dist/hive-site.xml
./etc/hive/conf.dist/hive-site.xml
$ view /etc/impala/conf.dist/hive-site.xml
$ view /etc/spark/conf.dist/hive-site.xml
$ view /etc/hive/conf.dist/hive-site.xml
but in these files I don't see such properties
Maybe cloudera is using other properties file.
May someone help me?
Thanks in advance
Regards.
The default values for all properties are defined in the source code of Hive. The configuration files provide means to override these default values.
But if the services are installed using Cloudera Manager, then modify the configurations using the cloudera manager user interface only. CM persists the configurations in its CM database and any changes made to the configuration files will not be considered.
If the components are manually installed using Cloudera parcels and if you want to override any configuration property, use the corresponding configuration file under /etc/<component>/conf/
.
There used to be a file called hive-default.xml.template
(not sure if that file is still being shipped in the package) which had all the hive properties along with their default values. It was meant only for reference, modifying the values in this file does not take effect.