Jump to content
  • Announcements

    • admin

      PBS Forum Has Closed   06/12/17

      The PBS Works Support Forum is no longer active.  For PBS community-oriented questions and support, please join the discussion at http://community.pbspro.org.  Any new security advisories related to commercially-licensed products will be posted in the PBS User Area (https://secure.altair.com/UserArea/). 

sps

Members
  • Content count

    1
  • Joined

  • Last visited

  1. Job priority and preemption Issues

    Hello, It may simply be a misunderstanding about what soft limits are for. Soft limits are used only to make jobs eligible for pre-emption. And this depends on some settings in sched_config to make it so. This directive "set server max_run_res_soft.ncpus += [g:group1=491]" effectively says that jobs from this group will be eligible to be preempted (suspended) once the group exceeds 491 cpus. If you really want to limit total usage instead of affecting preemption behavior, then the proper limit for this is: "set server max_run_res.ncpus += [g:group1=491]" (remove soft) That will cap group1's total cpu usage to 491. Moving on to the next topic, "top job" status is granted to jobs in two scenarios: 1. They've been "starving" for too long 2. Strict ordering is on and the job can't run immediately. Starving is on by default and strict ordering is not. Both are specified in sched_config. The strange date message is likely due to the way that the scheduler predicts when a job will eventually be able to run. It looks at all the jobs in the system and finds jobs which, when finished, will free up enough resources to allow the top job to run. It makes this prediction based on the "wall time" requested by the jobs. If there is no wall time requested, then PBS uses an implicit walltime of five years. What you're likely seeing is the build-up of a few jobs without walltime specified, so PBS assumes that the jobs will end in 5 years. You may not want strict ordering or starving job handling. If you turn both them off, then you won't see these messages about top job and far-future dates. It will also make the scheduling cycle quicker, since calendaring top jobs is an expensive operation. Hope that helps. If I've misunderstood anything, let me know. Thanks Steve
×