[MXNET-1084] Enable FP16 Mode for Integrated TensorRT Inference - ASF JIRA

XML

Word

Printable

JSON

Details

Type: Improvement
Status: To Do
Priority: Major
Resolution: Unresolved
Component/s: None
Labels:
None

Description

As as owner of a Turing, Volta, Pascal, or Jetson TX1 device, I would like to run inference using TensorRT with FP16 DType tensors to make use of dedicated hardware optimizations present.

Reference: https://devblogs.nvidia.com/tensor-core-ai-performance-milestones/
https://devblogs.nvidia.com/programming-tensor-cores-cuda-9/

Attachments

Activity

People

Assignee:: Unassigned

Reporter:: Kellen Sunderland

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 10/Oct/18 14:50

Updated:: 29/Dec/19 03:17