Uploaded image for project: 'IMPALA'
  1. IMPALA
  2. IMPALA-2468

5.5.x full dataload crashed while querying lineitem_multiblock

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Resolved
    • Blocker
    • Resolution: Fixed
    • Impala 2.3.0
    • Impala 2.3.0
    • None
    • None

    Description

      The full dataload on 5.5.x looks like it crashed last night while executing queries against lineitem_multiblock.

      04:04:53.434 -- executing against localhost:21000
      04:04:53.434 SELECT count(*) from lineitem_multiblock;
      04:04:53.434 
      04:04:53.434 -- executing against localhost:21000
      04:04:53.434 SELECT count(l_comment), min(l_partkey), max(l_linenumber) from lineitem_multiblock;
      04:04:53.437 
      04:04:53.437  TestTpchMemLimitError.test_low_mem_limit_q20[mem_limit: 180 | exec_option: {'disable_codegen': False, 'abort_on_error': 1, 'exec_single_node_rows_threshold': 0, 'batch_size': 0, 'num_nodes': 0} | table_format: parquet/none] 
      04:04:53.438 [gw2] linux2 -- Python 2.6.6 /data/2/jenkins/workspace/impala-cdh5.5.x-nightly-data-load/repos/Impala/bin/../infra/python/env/bin/python
      04:04:53.439 query_test/test_mem_usage_scaling.py:184: in test_low_mem_limit_q20
      04:04:53.439     self.low_memory_limit_test(vector, 'tpch-q20', self.MIN_MEM_FOR_TPCH['Q20']);
      04:04:53.439 query_test/test_mem_usage_scaling.py:96: in low_memory_limit_test
      04:04:53.439     assert TestLowMemoryLimits.EXPECTED_ERROR_MSG in str(e)
      04:04:53.439 E   assert 'Memory limit exceeded' in "ImpalaBeeswaxException:\n INNER EXCEPTION: <class 'socket.error'>\n MESSAGE: [Errno 104] Connection reset by peer"
      04:04:53.439 E    +  where 'Memory limit exceeded' = TestLowMemoryLimits.EXPECTED_ERROR_MSG
      04:04:53.439 E    +  and   "ImpalaBeeswaxException:\n INNER EXCEPTION: <class 'socket.error'>\n MESSAGE: [Errno 104] Connection reset by peer" = str(ImpalaBeeswaxException())
      04:04:53.439 ----------------------------- Captured stderr call -----------------------------
      

      There was also a failure like this:
      http://sandbox.jenkins.cloudera.com/view/Impala/view/Builds%20-%202.3.0%20Release/job/impala-cdh5.5.x-nightly-data-load/32/testReport/junit/query_test.test_scanners/TestParquet/test_parquet_exec_option____disable_codegen___False___abort_on_error___1___exec_single_node_rows_threshold___0___batch_size___0___num_nodes___0____table_format__parquet_none_/

      E   ImpalaBeeswaxException: ImpalaBeeswaxException:
      E    Query aborted:
      E   Column chunk should not contain two dictionary pages.
      E   
      E   
      E   
      E   Column chunk should not contain two dictionary pages.
      

      sailesh, please take a look and see how quickly you think the fix will be. If it will take a while to resolve, we may need to back out the parquet multiblock change.

      Attachments

        Activity

          People

            tarmstrong Tim Armstrong
            dhecht Daniel Hecht
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: