Uploaded image for project: 'Mesos'
  1. Mesos
  2. MESOS-8392

Framework disconnected

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Open
    • Major
    • Resolution: Unresolved
    • 1.0.1
    • None
    • framework, master
    • None
    • MESOS & DCOS

    Description

      Hi,

      My driver application is running in a container (it is a metronome job). It invokes a spark SQL request. Sometimes (not systematic), the execution fails. In mesos GUI (Graphical User Interface), framework, completed tasks, all tasks (7 tasks) are shown as KILLED.

      In master server, using "journalctl -u dcos-mesos-master -b | less", I can see:
      Jan 04 10:48:56 versailles-bcmt-master-2.ca4mn.com mesos-master[11092]: I0104 10:48:56.281100 11107 master.cpp:1284] Framework fc494a1f-479d-42f2-a2b8-350d383f86bd-1119 (Summarization) at scheduler-08001282-f999-4af3-a7ad-7de2f7da222c@10.75.219.13:45282 disconnected

      In attachment, traces.log.gz is output for:
      journalctl -u dcos-mesos-master -b | grep "Jan 04 10" > traces.log

      Attachments

        1. Framework_tasks_killed.PNG
          44 kB
          LANDAIS Christophe
        2. mesos_failed_task.PNG
          113 kB
          LANDAIS Christophe
        3. traces.log.gz
          763 kB
          LANDAIS Christophe

        Activity

          People

            Unassigned Unassigned
            LANDAIS Christophe LANDAIS Christophe
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated: