[HIVE-7926] long-lived daemons for query fragment execution, I/O and caching - ASF JIRA

XML

Word

Printable

JSON

Details

Type: New Feature
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 2.0.0
Component/s: None
Labels:
- TODOC2.0

Target Version/s:

2.0.0
Release Note:

Hide
LLAP is the new hybrid execution model that enables efficiencies across queries, such as caching of columnar data, JIT-friendly operator pipelines, and reduced overhead for multiple queries (including concurrent queries), as well as new performance features like asynchronous I/O, pre-fetching and multi-threaded processing. The hybrid model consists of a long-lived service interacting with on-demand elastic containers serving as a tightly integrated DAG-based framework for query execution.

The first version of LLAP is being shipped in Hive 2.0 release. The component has been extensively exercised on test and live clusters, and tested, but is expected to have rough edges in this initial release.
The current limitations are: supported with Tez only; does not support ACID tables; the I/O elevator and cache only support ORC format and vectorized execution.

Show
LLAP is the new hybrid execution model that enables efficiencies across queries, such as caching of columnar data, JIT-friendly operator pipelines, and reduced overhead for multiple queries (including concurrent queries), as well as new performance features like asynchronous I/O, pre-fetching and multi-threaded processing. The hybrid model consists of a long-lived service interacting with on-demand elastic containers serving as a tightly integrated DAG-based framework for query execution. The first version of LLAP is being shipped in Hive 2.0 release. The component has been extensively exercised on test and live clusters, and tested, but is expected to have rough edges in this initial release. The current limitations are: supported with Tez only; does not support ACID tables; the I/O elevator and cache only support ORC format and vectorized execution.

Description

We are proposing a new execution model for Hive that is a combination of existing process-based tasks and long-lived daemons running on worker nodes. These nodes can take care of efficient I/O, caching and query fragment execution, while heavy lifting like most joins, ordering, etc. can be handled by tasks.
The proposed model is not a 2-system solution for small and large queries; neither it is a separate execution engine like MR or Tez. It can be used by any Hive execution engine, if support is added; in future even external products (e.g. Pig) can use it.

The document with high-level design we are proposing will be attached shortly.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

LLAPdesigndocument.pdf
01/Sep/14 05:25
242 kB
Sergey Shelukhin

Sub-Tasks

1.	LLAP: separate decoding thread from read/uncompress thread	Open	Sergey Shelukhin
2.	LLAP: implement pause and stop for async data production	Open	Unassigned
3.	LLAP: row-level vectorized SARGs	Patch Available	Yohei Abe
4.	LLAP: (IO) consider using reactive framework to string together parts of code	Open	Unassigned
5.	LLAP: ORC decoding of row groups for complex types	Open	Prasanth Jayachandran
6.	LLAP: consider specialized "transient" metadata cache	Open	Unassigned
7.	documentation for llap	Open	Gunther Hagleitner
8.	LLAP: Add rack based scheduling of work	Open	Unassigned
9.	fix yarn service registry not found in ut problem	Open	Gunther Hagleitner
10.	LLAP : add LLAP IO read debug tool	Open	Sergey Shelukhin
11.	LLAP: adjust allocation after decompression	Open	Unassigned
12.	LLAP: enable yourkit profiling of tasks	Patch Available	Sergey Shelukhin
13.	LLAP: Use task number, attempt number to cache plans	Open	Unassigned
14.	LLAP: Tez heartbeats are delayed by ~500+ ms due to Hadoop IPC client	Open	Siddharth Seth
15.	LLAP: general cache deadlock avoidance	Open	Sergey Shelukhin
16.	Move fragment execution onto a thread pool	Open	Unassigned
17.	LLAP: Make use of additional information to determine run/preemption order	Open	Unassigned
18.	LLAP: fix container sizing configuration for memory	Open	Vikram Dixit K
19.	LLAP: allocator occasionally has a spurious failure to allocate due to "partitioned" locking and has to retry	Open	Sergey Shelukhin
20.	LLAP: Add counters for time lost per query due to preemption	Open	Unassigned
21.	LLAP: task scheduler thread-count keeps growing	Open	Siddharth Seth
22.	LLAP: Exception reported while trying to kill a task	Open	Prasanth Jayachandran
23.	LLAP: Better handling of hostnames when sending heartbeats to the AM	Open	Unassigned
24.	LLAP: Serialize handling of requests / events for a query within daemons	Open	Siddharth Seth
25.	LLAP: make ObjectCache for plans work properly in the daemon	Patch Available	Sergey Shelukhin
26.	Create LLAP Monitor Daemon class and launch scripts	Open	Yuya OZAWA

Activity

People

Assignee:: Sergey Shelukhin

Reporter:: Sergey Shelukhin

Votes:: 3 Vote for this issue

Watchers:: 47 Start watching this issue

Dates

Created:: 01/Sep/14 05:23

Updated:: 07/Oct/16 16:53

Resolved:: 18/Nov/15 20:25