Hive inserting parquet files with different schema
up vote
0
down vote
favorite
I am trying to insert 2 parquet files to HDFS via hive.
the first one looks like
----------------
|col1|col2|col3|
----------------
|1 |2 |3 |
----------------
when I select in HUE, I see this row correctly.
Then I insert the second file to the same table that looks like this:
---------------
col3|col4|col5|
---------------
31 |4 |5 |
---------------
The insert query works here but when I do a select query from all the rows, I get this output:
-------------------------
col1|col2|col3|col4|col5|
-------------------------
1 |2 |3 |null|null|
------------------------
NULL|NULL|31 |NULL|NULL|
------------------------
The second row should be null, null, 31,4,5
what am I doing wrong here?
hadoop hive
add a comment |
up vote
0
down vote
favorite
I am trying to insert 2 parquet files to HDFS via hive.
the first one looks like
----------------
|col1|col2|col3|
----------------
|1 |2 |3 |
----------------
when I select in HUE, I see this row correctly.
Then I insert the second file to the same table that looks like this:
---------------
col3|col4|col5|
---------------
31 |4 |5 |
---------------
The insert query works here but when I do a select query from all the rows, I get this output:
-------------------------
col1|col2|col3|col4|col5|
-------------------------
1 |2 |3 |null|null|
------------------------
NULL|NULL|31 |NULL|NULL|
------------------------
The second row should be null, null, 31,4,5
what am I doing wrong here?
hadoop hive
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43
add a comment |
up vote
0
down vote
favorite
up vote
0
down vote
favorite
I am trying to insert 2 parquet files to HDFS via hive.
the first one looks like
----------------
|col1|col2|col3|
----------------
|1 |2 |3 |
----------------
when I select in HUE, I see this row correctly.
Then I insert the second file to the same table that looks like this:
---------------
col3|col4|col5|
---------------
31 |4 |5 |
---------------
The insert query works here but when I do a select query from all the rows, I get this output:
-------------------------
col1|col2|col3|col4|col5|
-------------------------
1 |2 |3 |null|null|
------------------------
NULL|NULL|31 |NULL|NULL|
------------------------
The second row should be null, null, 31,4,5
what am I doing wrong here?
hadoop hive
I am trying to insert 2 parquet files to HDFS via hive.
the first one looks like
----------------
|col1|col2|col3|
----------------
|1 |2 |3 |
----------------
when I select in HUE, I see this row correctly.
Then I insert the second file to the same table that looks like this:
---------------
col3|col4|col5|
---------------
31 |4 |5 |
---------------
The insert query works here but when I do a select query from all the rows, I get this output:
-------------------------
col1|col2|col3|col4|col5|
-------------------------
1 |2 |3 |null|null|
------------------------
NULL|NULL|31 |NULL|NULL|
------------------------
The second row should be null, null, 31,4,5
what am I doing wrong here?
hadoop hive
hadoop hive
edited Nov 19 at 17:41
mustaccio
14k83637
14k83637
asked Nov 19 at 10:20
user1997656
163313
163313
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43
add a comment |
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43
add a comment |
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
active
oldest
votes
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Some of your past answers have not been well-received, and you're in danger of being blocked from answering.
Please pay close attention to the following guidance:
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53372507%2fhive-inserting-parquet-files-with-different-schema%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Could you please post your code so that people here can help you out?
– VIN
Nov 19 at 16:43