Unable to query using file in Data Proc Hive Operator












1















I am unable to query with .sql file in DataProcHiveOperator.
Though the documentation tells that we can query using file. Link of the documentation Here



It is working fine when I give query directly
Here is my sample code which is working fine with writing query directly:



HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
gcp_conn_id='google_cloud_default',
query='CREATE TABLE TABLE_NAME(NAME STRING);',
cluster_name='cluster-name',
region='us-central1',
dag=dag)


Querying with file :



HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
gcp_conn_id='google_cloud_default',
query='gs://us-central1-bucket/data/sample_hql.sql',
query_uri="gs://us-central1-bucket/data/sample_hql.sql
cluster_name='cluster-name',
region='us-central1',
dag=dag)


There is no error on sample_hql.sql script.

It is reading file location as a query and throwing me the error as:



Query: 'gs://bucketpath/filename.q'
Error occuring - cannot recognize input near 'gs' ':' '/'


Similar issue has also been raised Here










share|improve this question





























    1















    I am unable to query with .sql file in DataProcHiveOperator.
    Though the documentation tells that we can query using file. Link of the documentation Here



    It is working fine when I give query directly
    Here is my sample code which is working fine with writing query directly:



    HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
    gcp_conn_id='google_cloud_default',
    query='CREATE TABLE TABLE_NAME(NAME STRING);',
    cluster_name='cluster-name',
    region='us-central1',
    dag=dag)


    Querying with file :



    HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
    gcp_conn_id='google_cloud_default',
    query='gs://us-central1-bucket/data/sample_hql.sql',
    query_uri="gs://us-central1-bucket/data/sample_hql.sql
    cluster_name='cluster-name',
    region='us-central1',
    dag=dag)


    There is no error on sample_hql.sql script.

    It is reading file location as a query and throwing me the error as:



    Query: 'gs://bucketpath/filename.q'
    Error occuring - cannot recognize input near 'gs' ':' '/'


    Similar issue has also been raised Here










    share|improve this question



























      1












      1








      1








      I am unable to query with .sql file in DataProcHiveOperator.
      Though the documentation tells that we can query using file. Link of the documentation Here



      It is working fine when I give query directly
      Here is my sample code which is working fine with writing query directly:



      HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
      gcp_conn_id='google_cloud_default',
      query='CREATE TABLE TABLE_NAME(NAME STRING);',
      cluster_name='cluster-name',
      region='us-central1',
      dag=dag)


      Querying with file :



      HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
      gcp_conn_id='google_cloud_default',
      query='gs://us-central1-bucket/data/sample_hql.sql',
      query_uri="gs://us-central1-bucket/data/sample_hql.sql
      cluster_name='cluster-name',
      region='us-central1',
      dag=dag)


      There is no error on sample_hql.sql script.

      It is reading file location as a query and throwing me the error as:



      Query: 'gs://bucketpath/filename.q'
      Error occuring - cannot recognize input near 'gs' ':' '/'


      Similar issue has also been raised Here










      share|improve this question
















      I am unable to query with .sql file in DataProcHiveOperator.
      Though the documentation tells that we can query using file. Link of the documentation Here



      It is working fine when I give query directly
      Here is my sample code which is working fine with writing query directly:



      HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
      gcp_conn_id='google_cloud_default',
      query='CREATE TABLE TABLE_NAME(NAME STRING);',
      cluster_name='cluster-name',
      region='us-central1',
      dag=dag)


      Querying with file :



      HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
      gcp_conn_id='google_cloud_default',
      query='gs://us-central1-bucket/data/sample_hql.sql',
      query_uri="gs://us-central1-bucket/data/sample_hql.sql
      cluster_name='cluster-name',
      region='us-central1',
      dag=dag)


      There is no error on sample_hql.sql script.

      It is reading file location as a query and throwing me the error as:



      Query: 'gs://bucketpath/filename.q'
      Error occuring - cannot recognize input near 'gs' ':' '/'


      Similar issue has also been raised Here







      hive google-cloud-platform airflow google-cloud-dataproc






      share|improve this question















      share|improve this question













      share|improve this question




      share|improve this question








      edited Nov 22 '18 at 21:28







      Ashish Kumar

















      asked Nov 22 '18 at 4:51









      Ashish KumarAshish Kumar

      17213




      17213
























          1 Answer
          1






          active

          oldest

          votes


















          1














          The issue is because you have passed query='gs://us-central1-bucket/data/sample_hql.sql' as well.



          You should pass exactly 1 of query or queri_uri.



          The code in your question has both of them, so remove query or use the following code:



          HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
          gcp_conn_id='google_cloud_default',
          query_uri="gs://us-central1-bucket/data/sample_hql.sql",
          cluster_name='cluster-name',
          region='us-central1',
          dag=dag)





          share|improve this answer


























          • Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

            – Ashish Kumar
            Nov 22 '18 at 21:26











          Your Answer






          StackExchange.ifUsing("editor", function () {
          StackExchange.using("externalEditor", function () {
          StackExchange.using("snippets", function () {
          StackExchange.snippets.init();
          });
          });
          }, "code-snippets");

          StackExchange.ready(function() {
          var channelOptions = {
          tags: "".split(" "),
          id: "1"
          };
          initTagRenderer("".split(" "), "".split(" "), channelOptions);

          StackExchange.using("externalEditor", function() {
          // Have to fire editor after snippets, if snippets enabled
          if (StackExchange.settings.snippets.snippetsEnabled) {
          StackExchange.using("snippets", function() {
          createEditor();
          });
          }
          else {
          createEditor();
          }
          });

          function createEditor() {
          StackExchange.prepareEditor({
          heartbeatType: 'answer',
          autoActivateHeartbeat: false,
          convertImagesToLinks: true,
          noModals: true,
          showLowRepImageUploadWarning: true,
          reputationToPostImages: 10,
          bindNavPrevention: true,
          postfix: "",
          imageUploader: {
          brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
          contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
          allowUrls: true
          },
          onDemand: true,
          discardSelector: ".discard-answer"
          ,immediatelyShowMarkdownHelp:true
          });


          }
          });














          draft saved

          draft discarded


















          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53424091%2funable-to-query-using-file-in-data-proc-hive-operator%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown

























          1 Answer
          1






          active

          oldest

          votes








          1 Answer
          1






          active

          oldest

          votes









          active

          oldest

          votes






          active

          oldest

          votes









          1














          The issue is because you have passed query='gs://us-central1-bucket/data/sample_hql.sql' as well.



          You should pass exactly 1 of query or queri_uri.



          The code in your question has both of them, so remove query or use the following code:



          HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
          gcp_conn_id='google_cloud_default',
          query_uri="gs://us-central1-bucket/data/sample_hql.sql",
          cluster_name='cluster-name',
          region='us-central1',
          dag=dag)





          share|improve this answer


























          • Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

            – Ashish Kumar
            Nov 22 '18 at 21:26
















          1














          The issue is because you have passed query='gs://us-central1-bucket/data/sample_hql.sql' as well.



          You should pass exactly 1 of query or queri_uri.



          The code in your question has both of them, so remove query or use the following code:



          HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
          gcp_conn_id='google_cloud_default',
          query_uri="gs://us-central1-bucket/data/sample_hql.sql",
          cluster_name='cluster-name',
          region='us-central1',
          dag=dag)





          share|improve this answer


























          • Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

            – Ashish Kumar
            Nov 22 '18 at 21:26














          1












          1








          1







          The issue is because you have passed query='gs://us-central1-bucket/data/sample_hql.sql' as well.



          You should pass exactly 1 of query or queri_uri.



          The code in your question has both of them, so remove query or use the following code:



          HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
          gcp_conn_id='google_cloud_default',
          query_uri="gs://us-central1-bucket/data/sample_hql.sql",
          cluster_name='cluster-name',
          region='us-central1',
          dag=dag)





          share|improve this answer















          The issue is because you have passed query='gs://us-central1-bucket/data/sample_hql.sql' as well.



          You should pass exactly 1 of query or queri_uri.



          The code in your question has both of them, so remove query or use the following code:



          HiveInsertingTable = DataProcHiveOperator(task_id='HiveInsertingTable',
          gcp_conn_id='google_cloud_default',
          query_uri="gs://us-central1-bucket/data/sample_hql.sql",
          cluster_name='cluster-name',
          region='us-central1',
          dag=dag)






          share|improve this answer














          share|improve this answer



          share|improve this answer








          edited Nov 23 '18 at 12:14

























          answered Nov 22 '18 at 15:47









          kaxilkaxil

          3,063927




          3,063927













          • Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

            – Ashish Kumar
            Nov 22 '18 at 21:26



















          • Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

            – Ashish Kumar
            Nov 22 '18 at 21:26

















          Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

          – Ashish Kumar
          Nov 22 '18 at 21:26





          Yes that's correct. And also the spelling of "query_uri" was written as "queri_uri". My bad. Thanks @kaxil

          – Ashish Kumar
          Nov 22 '18 at 21:26


















          draft saved

          draft discarded




















































          Thanks for contributing an answer to Stack Overflow!


          • Please be sure to answer the question. Provide details and share your research!

          But avoid



          • Asking for help, clarification, or responding to other answers.

          • Making statements based on opinion; back them up with references or personal experience.


          To learn more, see our tips on writing great answers.




          draft saved


          draft discarded














          StackExchange.ready(
          function () {
          StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f53424091%2funable-to-query-using-file-in-data-proc-hive-operator%23new-answer', 'question_page');
          }
          );

          Post as a guest















          Required, but never shown





















































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown

































          Required, but never shown














          Required, but never shown












          Required, but never shown







          Required, but never shown







          Popular posts from this blog

          Costa Masnaga

          Fotorealismo

          Sidney Franklin