How to insert data into a table in Hive?

1 year ago

Jackson Davis

2 minutes

Hive is a data warehousing tool used for querying and analyzing data on Hadoop. There are two ways to insert data into tables in Hive.

You can insert data into a table using the INSERT INTO statement in Hive, with the following syntax:

INSERT INTO TABLE table_name [PARTITION (partition_col = partition_col_value, ...)] VALUES (value1, value2, ...);

In this case, table_name is the name of the table where the data will be inserted, PARTITION clause is used to specify the partition, and VALUES clause is used to specify the values to be inserted.

With the Hive LOAD DATA statement, you can load data into a table using the following syntax:

LOAD DATA [LOCAL] INPATH 'input_path' [OVERWRITE] INTO TABLE table_name [PARTITION (partition_col = partition_col_value, ...)];

The input_path is the file path of the data to be loaded, the LOCAL keyword indicates that the data file is located on the local file system, the OVERWRITE keyword indicates to overwrite existing data, table_name is the name of the table to load data into, and the PARTITION clause is used to specify partitions.

The above are two methods for inserting data into a Hive table, choose the appropriate method based on specific needs and circumstances.