Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-4004

impala-asf-master-core-non-partitioned-joins-and-aggs failing due to select query querying nested types

    XMLWordPrintableJSON

Details

    Description

      http://sandbox.jenkins.cloudera.com/job/impala-asf-master-core-non-partitioned-joins-and-aggs/40/

      kwho - I assigned this to you as the git history looks like you added tpch_nested_parquet to the query and I thought you might have an idea what’s going on here; feel free to find another person or assign back to me if you're swamped.

      Here's the relevant error message:

      03:30:51.213 =================================== FAILURES ===================================
      03:30:51.213  TestFailpoints.test_failpoints[target_node: ('UNNEST [c.c_orders.o_lineitems', [4]) | table_format: text/none | exec_option: {'disable_codegen': False, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0, 'batch_size': 0, 'num_nodes': 0} | query_type: SELECT | location: GETNEXT | action: CANCEL] 
      03:30:51.213 [gw1] linux2 -- Python 2.6.6 /data/jenkins/workspace/impala-umbrella-build-and-test/repos/Impala/bin/../infra/python/env/bin/python
      03:30:51.213 failure/test_failpoints.py:138: in test_failpoints
      03:30:51.213     self.execute_query(query, vector.get_value('exec_option'))
      03:30:51.213 common/impala_test_suite.py:407: in wrapper
      03:30:51.213     return function(*args, **kwargs)
      03:30:51.213 common/impala_test_suite.py:432: in execute_query
      03:30:51.213     return self.__execute_query(self.client, query, query_options)
      03:30:51.213 common/impala_test_suite.py:499: in __execute_query
      03:30:51.213     return impalad_client.execute(query, user=user)
      03:30:51.213 common/impala_connection.py:160: in execute
      03:30:51.213     return self.__beeswax_client.execute(sql_stmt, user=user)
      03:30:51.213 beeswax/impala_beeswax.py:173: in execute
      03:30:51.213     handle = self.__execute_query(query_string.strip(), user=user)
      03:30:51.213 beeswax/impala_beeswax.py:339: in __execute_query
      03:30:51.213     self.wait_for_completion(handle)
      03:30:51.213 beeswax/impala_beeswax.py:359: in wait_for_completion
      03:30:51.213     raise ImpalaBeeswaxException("Query aborted:" + error_log, None)
      03:30:51.213 E   ImpalaBeeswaxException: ImpalaBeeswaxException:
      03:30:51.213 E    Query aborted:
      03:30:51.213 E   Query referencing nested types is not supported because the --enable_partitioned_hash_join and/or --enable_partitioned_aggregation Impala Daemon start-up flags are set to false.
      03:30:51.213 E   To enable nested types support please set those flags to true (they are enabled by default).
      03:30:51.213 ---------------------------- Captured stderr setup -----------------------------
      03:30:51.213 -- connecting to: localhost:21000
      03:30:51.213 MainThread: Found credentials in environment variables.
      03:30:51.213 ----------------------------- Captured stderr call -----------------------------
      03:30:51.213 MainThread: Current dubug action: SET DEBUG_ACTION=4:GETNEXT:WAIT
      03:30:51.213 MainThread: Starting async query execution
      03:30:51.213 -- executing against localhost:21000
      03:30:51.213 use functional;
      03:30:51.213 
      03:30:51.213 SET batch_size=0;
      03:30:51.213 SET num_nodes=0;
      03:30:51.213 SET disable_codegen=False;
      03:30:51.213 SET abort_on_error=1;
      03:30:51.213 SET debug_action=4:GETNEXT:WAIT;
      03:30:51.213 SET exec_single_node_rows_threshold=0;
      03:30:51.213 -- executing async: localhost:21000
      03:30:51.213 
      03:30:51.213 select a.int_col, count(b.int_col) int_sum, count(l.l_shipdate)
      03:30:51.213 from functional_hbase.alltypesagg a, tpch_nested_parquet.customer c, c.c_orders.o_lineitems l
      03:30:51.213 join
      03:30:51.213   (select * from alltypes
      03:30:51.213    where year=2009 and month=1 order by int_col limit 2500
      03:30:51.213    union all
      03:30:51.213    select * from alltypes
      03:30:51.213    where year=2009 and month=2 limit 3000) b
      03:30:51.213 on (a.int_col = b.int_col) and (a.int_col = c.c_custkey)
      03:30:51.213 where c.c_mktsegment = 'BUILDING'
      03:30:51.213 group by a.int_col
      03:30:51.213 order by int_sum
      03:30:51.213 ;
      03:30:51.213 
      03:30:51.213 MainThread: Sleeping
      03:30:51.213 -- canceling operation: <tests.common.impala_connection.OperationHandle object at 0x65c5490>
      03:30:51.213 -- closing query for operation handle: <tests.common.impala_connection.OperationHandle object at 0x65c5490>
      03:30:51.213 SET batch_size=0;
      03:30:51.213 SET num_nodes=0;
      03:30:51.213 SET disable_codegen=False;
      03:30:51.213 SET abort_on_error=1;
      03:30:51.213 SET exec_single_node_rows_threshold=0;
      03:30:51.213 -- executing against localhost:21000
      03:30:51.213 
      03:30:51.213 select a.int_col, count(b.int_col) int_sum, count(l.l_shipdate)
      03:30:51.213 from functional_hbase.alltypesagg a, tpch_nested_parquet.customer c, c.c_orders.o_lineitems l
      03:30:51.213 join
      03:30:51.213   (select * from alltypes
      03:30:51.213    where year=2009 and month=1 order by int_col limit 2500
      03:30:51.213    union all
      03:30:51.213    select * from alltypes
      03:30:51.213    where year=2009 and month=2 limit 3000) b
      03:30:51.213 on (a.int_col = b.int_col) and (a.int_col = c.c_custkey)
      03:30:51.213 where c.c_mktsegment = 'BUILDING'
      03:30:51.213 group by a.int_col
      03:30:51.213 order by int_sum
      03:30:51.213 ;
      

      Attachments

        Activity

          People

            kwho Michael Ho
            lv Lars Volker
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: