Download User`s Guide
Transcript
Appendix B. TORQUE Release Information 4.1.4 e - When in cray mode, write physmem and availmem in addition to totmem so that Moab correctly reads memory info. e - Specifying size, nodes, and mppwidth and all mutually exclusize, so reject job submissions that attempt to specify more than one of these. TRQ-1185. b - Merged changes for revision 7000 by hand because the merge was not clean. fixes problems with a deadlock when doing job dependencies using synccount/syncwith. TRQ-1374 b - Fix a segfault in req_jobobit due to an off-by-one error. TRQ-1361. e - Add the svn revision to --version outputs. TRQ-1357. b - Fix a race condition in mom hierarchy reporting. TRQ-1378. b - Fixed pbs_mom so epilogue will only run once. TRQ-1134 b - Fix some debug output escaping into job output. TRQ-1360. b - Fixed a problem where server threads all get stuck in a poll. The problem was an infinite loop created in socket_wait_for_read if poll return -1. TRQ-1382 b - Fix a Cray-mode bug with jobs ending immediately when spanning nodes of different proc counts when specifying -l procs. TRQ-1365. b - Don’t fail to make the tmpdir for sister moms. bugzilla #220, TRQ-1403. c - Fix crashes due to unprotected array accesses. TRQ-1395. b - Fixed a deadlock in get_parent_dest_queues when the queue_parent_name and queue_dest_name are the same. TRQ-1413. 11/7/12 b - Fixed segfault in req_movejob where the job ji_qhdr was NULL. TRQ-1416 b - Fix a conflict in the code for herogeneous jobs and regular jobs. b - For alps jobs, use the login nodes evenly even when one goes down. TRQ-1317. b - Display the correct ’Assigned Cpu Count’ in momctl output. TRQ-1307. b - Make pbs_original_connect() no longer hang if the host is down. TRQ-1388. b - Make epilogues run only once and be executed by the child and not the main pbs_mom process. TRQ-937. b - Reduce the error messages in HA mode from moms. They now only log errors if no server could be contacted. TRQ-1385. b - Fixed a seg-fault in send_depend_req. Also fixed a deadlock in the depend_on_term TRQ-1430 and TRQ-1436 b - Fixed a null pointer dereference seg-fault when checking for disallowed types TRQ-1408. b - Fix a counting problem when running multi-req ALPS jobs (cray only). TRQ-1431. b - Remove red herring error messages ’did not find work task for local request’. These tasks are no longer created since issue_Drequest blocks until it gets the reply and then processes it. TRQ-1423. b - Fixed a problem where qsub was not applying the submit filter when given in the torque.cfg file. TRQ-1446 e - When the mom has no jobs, check the aux path to make sure it is clean and that we aren’t leaving any files there. TRQ-1240. b - Fix a counting problem when running multi-req ALPS jobs (cray only). TRQ-1431. b - Remove red herring error messages ’did not find work task for local request’. These tasks are no longer created since issue_Drequest blocks until it gets the reply and then processes it. TRQ-1423. e - When the mom has no jobs, check the aux path to make sure it is clean and that we aren’t leaving any files there. TRQ-1240. b - Made it so that threads taken up by poll job tasks cannot consume all available threads in the thread pool. This will make it so other work can continue if poll jobs get stuck for whatever reason and that the server will recover. TRQ-1433 b - Fix a deadlock when recording alps reservations. TRQ-1421. b - Fixed a segfault in req_jobobit caused by NULL pointer assignment to variable pa. TRQ-1467 54