Uploaded image for project: 'Singa'
  1. Singa
  2. SINGA-134

Extend SINGA to run over a GPU cluster

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Resolved
    • Major
    • Resolution: Fixed
    • None

    Description

      Currently SINGA is able to run over a cluster of nodes using CPU and over a single node with multiple GPUs.
      This ticket is going to extend SINGA to run over a GPU cluster.
      The framework is applicable for such training environment.
      We need to update the code for allocating the GPU workers on different nodes and for messaging passing between GPUs on different nodes (refer to SINGA-133).

      Attachments

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            wangwei.cs wangwei
            wangwei.cs wangwei
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment