+ Reply to Thread
Results 1 to 3 of 3

Thread: SymphonyDE cluster error

  1. #1
    jmulerik is offline Junior Member
    Join Date
    March 28th, 2008
    Location
    Canberra
    Posts
    14
    Downloads
    0
    Uploads
    0

    Default SymphonyDE cluster error

    While I was testing Symphony CG Solver Application in our linux cluster (Centos 4.5), I got the following error, when the order of the Matrix gone beyond 1000 +

    exception caught ... Domain <Application>: Cannot operate on session : BindServiceFailed: RETRYCOUNTEXCEEDED - session has already retried 5 times, and the rerun limit is 5 times.

    I am using up to 1000 tasks to ensure proper scaling of tasks between the cluster nodes.

    What this exception means? Any clue? (btw, I am reusing the same session through out the Application)

    Where is the documentation regarding these Exceptions?
    Last edited by jmulerik; October 1st, 2008 at 02:23 AM.
    Jaison Mulerikkal
    DCS, ANU, Canberra
    Australia

    http://cs.anu.edu.au/~Jaison.Mulerikkal

  2. #2
    Ajith's Avatar
    Ajith is offline Symphony DE Moderator
    Join Date
    February 28th, 2008
    Location
    Markham, Ontario
    Posts
    108
    Blog Entries
    2
    Downloads
    10
    Uploads
    0

    Default

    Hi Jaison,

    BindService is the same as OnSessionEnter in your service application. The error indicates that for your session, OnSessionEnter returned a FailureException 5 times to the SIMs in your cluster. The SSM log collects information on FailureExceptions so check there. You can configure retry count in the application profile here: sessionRetryLimit="3".

    - Ajith

    Quote Originally Posted by jmulerik View Post
    While I was testing Symphony CG Solver Application in our linux cluster (Centos 4.5), I got the following error, when the order of the Matrix gone beyond 1000 +

    exception caught ... Domain <Application>: Cannot operate on session : BindServiceFailed: RETRYCOUNTEXCEEDED - session has already retried 5 times, and the rerun limit is 5 times.

    I am using up to 1000 tasks to ensure proper scaling of tasks between the cluster nodes.

    What this exception means? Any clue? (btw, I am reusing the same session through out the Application)

    Where is the documentation regarding these Exceptions?

  3. #3
    jmulerik is offline Junior Member
    Join Date
    March 28th, 2008
    Location
    Canberra
    Posts
    14
    Downloads
    0
    Uploads
    0

    Default

    Thanks Ajith.. I will try that
    Jaison Mulerikkal
    DCS, ANU, Canberra
    Australia

    http://cs.anu.edu.au/~Jaison.Mulerikkal

+ Reply to Thread

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts