The Son of Grid Engine is a community project to continue Sun Grid Engine.
Notable changes in Son of Grid Engine
-------------------------------------
Version 8.0.0d
--------------
* Bug fixes
* Man and fixes
* Fix building with older gcc versions
* Provide load average in qstat XML output [#446, #454]
* Partially back out Univa change which broke classic spooling
* Fix -terse in sge_request [#777]
* Other changes (possibly-incompatible)
* Message fixes
Version 8.0.0c
--------------
* Bug fixes
* Man and other documentation fixes
* Build/installation fixes (particularly for Red Hat 6 and Linux 3)
* Fix group ids for submitted jobs [U]
* Fix default value of boolean with JSV [U]
* Windows fixes for helper crashes and Vista GUI jobs [U]
* Ensure parallel jobs are dispatched to the least loaded host [U]
* Correct ownership of qsub -pty output file; was owned by admin user [U]
* Fix format of Windows loadcheck.exe output [U]
* Read from stderr even if stdout is already closed in IJS [U]
* Fix PDC_INTERVAL=NEVER execd parameter [U]
* Fix accounting information for Windows GUI jobs [U]
* Increase default MAX_DYN_EC qmaster param [U]
* Fix qsub -sync y error message and enforce MAX_DYN_EC correctly [U]
* Fix job validation (-w e) behaviour [#716] [U]
* Fix qrsh input redirection [U]
* Avoid warning when submitting a qrsh job [U]
* Print start time in qstat -j -xml output [U]
* Don't raise an error changing resource request on waiting job [#806]
* Don't exit 0 on error with qconf -secl or -sep
* Include string.h in drmaa.h [#712]
* Fix process-scheduler-log with host aliases
* Enhancements
* Base qmake and qtcsh on the current gmake and tcsh source [#289,
#504, #832]
* Support "-binding linear" and "-binding linear:slots"
* Use the hwloc library for all topology information and core
binding, supporting more operating systems (now: AIX, Darwin,
FreeBSD, GNU/Linux, HPUX, MS Windows, OSF/1, Solaris), and more
hardware types (specifically AMD Magny Cours and similar)
* Add task number to execd "exceeds job ... limit"
* Other changes (possibly-incompatible)
* Modify default paths in build files and elsewhere [U]
* Assorted message fixes
* In RPMs, move qsched to qmaster package, and separate drmaa4ruby
* Default to newijs in load_sge_config.sh
* Default to sh, not csh for configured shell
Version 8.0.0b
--------------
* Bug fixes
* Build/installation fixes [including #424, #1349] [(U)]
* Fix execd init script [#1348]
* Man and other documentation fixes [including #614, #764] [(U)]
* Fix contents of admin mail properly [#1307, #1345]
* Fix qalter messages for -tc
* Fix build with -DSGE_PQS_API
* Fix group ids for submitted jobs [U]
* Enhancements
* Update qsched and add man page
* Other changes (possibly-incompatible)
* Avoid the use of /bin/ksh [#1306]
* Change installation defaults to classic spooling, not adding
shadow hosts, and not JMX. [(U)]
Version 8.0.0a
--------------
This is roughly a superset of Univa's 8.0.0 (the V800_TAG from
https://github.com/gridengine/gridengine), with thanks for that.
Changes made there which haven't been included in this version: PLPA
source not removed; some different build/installation defaults
(e.g. for JMX); Univa/UGE "branding" (partly because trademark status
is unknown); authuser not removed (for SDM and testing use),
* Bug fixes
* Many man and other documentation fixes [including #790, #776,
#769, #733, #610, #587, #581, #459, #456, #439, #255, #1288, #797,
#1271, #773] [(U)]
* Some program message fixes [(U)]
* Various build and installation fixes [including #761, #709, #656,
#616, #546, #536, #521, #491, #438, #414, #411, #383, #381, #138,
#455, #344, #438, #1311, #1272, #1273] [(U)]
* Ask for keystore password twice on installation
* Fix qmaster crashes with tightly integrated parallel jobs or
un-discoverable qinstance [#789] [U]
* Report 0 cores and sockets on unsupported Solaris hosts [U]
* Fix malloc hooks which caused crashes, particularly with SuSE 11
[#792, #748, #749] [U]
* Verify the pe task start user in execd in non-CSP mode [U]
* Fix binding parameters parsing [U]
* Fix JSV logging with multiple users submitting jobs on same submit
host [U]
* Fix unresponsive qmaster when modifying the global configuration
in a huge cluster [U]
* Speed up finishing tightly integrated jobs [U]
* Check consistency of JSV binding information properly [U]
* Fix broken project spooling, which caused loss of project when
restarting master when using core binding [U]
* Fix slotwise preemption failure to unsuspend one job per host [#775] [U]
* Fix problems retrieving passwd and group information with large
responses [#1295] [(U)]
* Fix JSV changing default of boolean [U]
* Fix ENABLE_RESCHEDULE_SLAVE=1 [U]
* Allow comma in CMDNAME with Perl JSV scripts [#803]
* Don't put queue into error state when supplementary group id
cannot be set [#185] [U]
* Don't convert LF to CRLF with qrsh -pty [U]
* Fix qconf segfault on bad subordination string [U]
* Fix group ids of submitted jobs [U]
* Disallow -masterq with serial jobs [#155] [U]
* Fix 100% CPU use by shepherd of qsh [U]
* Removed unnecessary binding warning on job starts [U]
* Fix qconf error reports when tmp directory has 755 permissions [U]
* Fix suspending of remote process on qrsh -pty yes <cmd> on Solaris [U]
* Fix starting jobs after global host changed [U]
* Reject invalid load_formula value [U]
* Fix handling of implicitly-requested exclusive resources [U]
* Fix execd vmem reporting on 64-bit Linux [U]
* Fix startup of execd on Windows Vista [U]
* Set xterm's path more appropriately on GNU/Linux [#557]
* Fix generation of admin email from failed jobs [#1307]
* Fix some ill-formed output from qstat -xml [#314]
* Fix handling of multi-line environment variables propagated to
shepherd [#395]
* Fix example MPI PE templates
* Fix bad quoting in JSV sh library
* Fix checking of consumables for parallel jobs across multiple hosts [U]
* Enhancements
* Additional and clarified documentation
* PAM modules for ssh tight integration and access control for
interactive jobs
* Initial core binding support for Solaris/SPARC64 [U]
* Some efficiency improvements and memory leaks fixed [U]
* Ports to S/390 and PARISC GNU/Linux [U]
* New complex m_thread [U]
* Show topology by default in qhost [U]
* qsub -pty switch [#704] [U]
* Improved qmon graphics [#530] [(U)]
* Include bash in default shell list [U]
* A JSV that rejects all jobs [U]
* Files for Scali-MPI
* Ruby DRMAA implementation
* Enable easy building against shared system libraries and use
system openssl and bdb binaries
* New scripts: "qsched" reports resource reservations; "status"
wraps qstat; enable/disable submission; node-selection (idle etc.)
* Restart argument for daemon init scripts
* Improved efficiency of shell JSV if used with bash
* Core dumps from crashing daemons enabled under Linux [U]
* Example host_aliases file [#154]
* Spec file for RPM packaging [#820]
* Other changes (possibly-incompatible)
* Show core binding by default in qstat, qhost (use -ncb for
compatibility) [U]
* Removed Berkeley DB RPC support (recently dropped by BDB) [U]
* Changed position in pending job list for user-rescheduled jobs
(exit99, qmod -rj) and OLD_RESCHEDULE_BEHAVIOR,
OLD_RESCHEDULE_BEHAVIOR_ARRAY_JOB parameters [U]
* Unified GNU/Linux arch strings (lx-*, from lx24-* and lx26-*) [U]
* Default to enabling core binding on GNU/Linux [U]
* Removed Sun service tags support [U]
* Removed obsolete SunHPCT5 files
Grid Engine 6.2, which has undergone significant changes in qmaster to
significantly improve its scalability in challenging environments, adds
powerful features to the core system, introduces multi cluster support
for the Accounting and Reporting Console (ARCo) and comes with a new
module extending the scope of Grid Engine to a new domain of use cases:
the Service Domain Manager (SDM), aka. project Hedeby allows to
dynamically (re-)assign computational resources on demand.
plus lots of bug fixes.
patch-au compiles sge_arch.c with -ansi so that stringification hack works
on NetBSD and FreeBSD and probably others. Otherwise architecture names
like nbsd-i386 turn into nbsd-1 - From the FreeBSD port.
Bugs fixed in SGE 6.1u5 since release 6.1u4
wrong documentation for upgrade 6.0u2 and higher to 6.1u2 and higher
Multiple loadsensor instances are trying to access the same temp load
file on AIX51
Validation of the Filter List in Simple Query builder fails
qhost -l h=<hostname> does not work
Numbers in error mail too large
use of the same pathes for input/output stream must be dealt with
DRMAA Java language binding segfaults on Session.exit() with sol-x86
binaries on AMD64
sgeexecd startup script shouldn't suppress error messages from sge_execd
binary
Advanced Query with wild card character * does not produce correct results.
'Infinity' must be rejected when specified in 'complex_values' or RQS
limits for consumables
Invalid qconf -mrqs crashes qmaster with segmentation fault
RQS: Line wrap of host list introduces syntax error
Row Limit in ARCo Simple Query builder cannot be empty
loadsensor does not work on AIX51
qhost -xml has wrong namespace
QMON: The help for Resource Quotas is not available
qmon fills in fields incorrectly for restoring config for Submit Job
sgemaster -qmaster stop shutdowns also shadowd
incorrect depencency on xinetd in init scripts for linux
Latebindings for Advanced Queries does not work
Switching from Simple Query to Advanced Query removes the Latebingings
32-bit Linux binaries are having problems with file access in 64-bit NFS
environments
using of default_domain may prohibit execd installation
Commlib might crash if running out of memory
Configuration file check of automatic installation does not recognize remsh
loadcheck prints error message "kstat finds too many cpus"
Communication library thread locking problem results in qmaster crash
ARCo should not print exception stack trace in the console
TABLESPACE values should be written to dbwriter.conf
Incorrectly considering two host group names to be the same
Clients not disconnecting correctly
SGE util/arch script is broken for AIX 5.3 Operating System
error message given by qalter -q '' <jobid> suggests a memory access problem
bootstrap(5) man page sees itself als sge_conf(5)
qmaster reinstall overwriting an existing installation fails
qconf -ae|-Ae return 0 even if exechost exists already
qconf -dxxxx does not set exit status on error
qconf -as, -am, -ao, -Ae, -Acal, -Ackpt, -Ap when msg "already exist"
should return not 0 exit code
qconf -acal doesn't return error code 1 when failed
setting of QMaster port number leads to infinite loop
use of -l tmpdir=abc can crash schedd
load scaling display not working correctly
qstat -j does not print array task information
job hold due to -hold_jid is not indicated as STATE_SYSTEM_ON_HOLD by
drmaa_job_ps(3)
Segmentation fault of sge_schedd
A load sensor reporting values for other hosts does not work
reporting file is lacking information about global consumables, if
log_consumables=false
Wallclock_Time query should be more constrained
"./install_execd -winsvc -auto /path/to/auto.conf" command causes error
The default has to be local spool directory when install_execd is run for
a Windows host
qmaster runs out of memory on AIX
dbw install parameters are not verified
Incorrect slots_total from qstat -F -xml output
Wrong permissions if install_qmaster creates qmaster spool directory
Installation of execution daemon left user unclear which port was chosen.
Exception occurs during the exportation of a query result to pdf
memory leak in sge_execd with qsub -v SGE_* or qsub -V
ARCo should support SJWC 3.1
Bugs fixed in SGE 6.1u4 since release 6.1u3
on Windows installation fails when installing as root and SGE admin
user = none
accounting records for slave tasks of pe jobs should contain the correct
task submission time
check if config parameters qlogin_daemon and rlogin_daemon are pathes
parallel scheduling memory leak in sge_schedd
execd installation does not test absolute path for local spool dir
Sort on table column throws exception if explicit SORT specified in
SimpleQuery Sort on table column
Error.jsp contains unbalanced tagError.jsp contains unbalanced tag
arco_read should be able to create synonyms instead of arco_write
DBW should use batch inserts
prolog an epilog descriptions should include exit codes
It is possible to negative tickets / shares in qmon and from the command
line
ORDER BY clause ignored in Advanced Query
Queue Consumables query incorrect in ARCo predefined queries
CLI accepts the slot number of more than 10000000
ARCo online help contains invalid, unclear or outdated information
the installation of two rpc databases on the some host fails
DBWriter should not exit if there is a database connection error
Reporting 'View' dropdown menu and 'Save Result' functionality is confusing
DBW derived rules and reporting queries that count jobs need to be updated
incomplete error loging in case of classic spooling failures
Row Limit in Simple Query uses wrong syntax
NONE' as value is not rejected for queue_conf(5) shell and qsub(1) -S
Upgrade to 6.1u3 fails for PostgreSQL < 8.0, minor issues i
dbdefinition.xml for PSQL > 8.0
dbwriter should write checkpoint to database
dbwriter deletion rules delete tasks of pe_jobs
unclear 'exit_status' description in accounting(5) about Grid Engine
specific status
autoinstall configfile should be parsed and checked for valid input!
qstat -j output is broken for shell_path
the project field should be displayed in the qstat -j output
Wrong variable for calculating daily host values from hourly ones
Pending PE job qstat -j output displays addtional useless message when not
running because of RQs
automatic backup is broken!
Spelling mistakes in the qmon help menus
deletion rule for PostgreSQL incorrect for deletion of sge_share_log
qquota broken if quota definition contains "hosts" or "users" scope negation
Access_list(5) man page not precise enough with regards secondary/primary
group(s)
RQS debitation of running jobs is broken if enabled by -mattr
Set SGE_QMASTER_PORT in settings file if sge_qmaster is not found in
/etc/services file
Failed to deliver STOP signal for subordinated jobs
Missing array job task usage in the accounting file
qhost/qstat can't be interrupted with ctrl-c
typographical errors in messages from install_qmaster
Sort order and row limit cannot be specified together in ARCo Simple Query
builder
Qmaster segfaults with long host resource evaluation expression
Error message for unsupported platforms should be more verbose
qsub does not accept resource strings size larger than 256
Memory leak in drmaa_run_job()/drmaa_run_bulk_job()
ARCo reporting module installation script is broken on Red Hat Enterprise
Linux 4 Update 4
Job predecessor list missing from qstat -j output
In SJWC on Oracle dates appear truncated to just MM/DD/YYYY
configfile check in automatic installation is to strict
load sensor might block execd port
Uninstallation of remote execd if not interactive
Infotext spawned on remote machine with -wait or -ask does not display the
text
Uninstall does not remove the SGE_STARTUP_SCRIPT
qmaster crashes when SGE_ND=1, dl 2 and BDB server spooling
inst_sge -ux all -um fails
Usage string for some commands is incomplete
dbwriter installation can't finish on large amount of data
reprioritize disappears after sge_qmaster restart
qmaster failover should not change the state of any queue
to trigger/signal a rebuild for the transition 5.8.8 -> 5.10.0.
The list of packages is computed by finding all packages which end
up having either of PERL5_USE_PACKLIST, BUILDLINK_API_DEPENDS.perl,
or PERL5_PACKLIST defined in their make setup (tested via
"make show-vars VARNAMES=...").
-------------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
376 4743006 problem with floating point job resource limits
1909 6353628 information provided by qstat -j and qstat -j -xml are not equivalent
2076 6440408 qstat -j messages disagree between plain, XML output
2077 6440412 qstat -j -xml messages incomplete
2138 6506667 forbid deletion of global config values
2194 6527836 authuser binary returns unusable error message!
2249 6568575 SGE does not work if primary group entry is too big in groups map
2270 6575720 ENABLE_ADDGRP_KILL is missing from sge_conf(5)
2272 6575727 sge_shadowd(8) man page is missing some env vars
2274 6564461 Duplicate scheduling info messages for reservation jobs
2276 6575731 share_tree(5) doesn't explain type field
2283 6565821 Oracle, Postgres DWB should prompt for tablespace where indexes and tables should be created
2293 6569088 Resource reservation broken for sequential jobs depending on RQS specified for subset of queues only
2303 6571749 parallel resource reservation broken when non-queue instance based quotas limits apply
2323 6576153 Creating a userset with NONE as a type results in a core dump
2327 6578213 qconf -(A,D,M,R)attr dumps core when the supplied file is empty
2328 6579232 high scheduler dispatching time with many sequential resource reservation jobs and resource quotas
2336 6287501 rctemplates lack of requirement
2338 6585721 Parallel RR broken if jobs wait for queue slots and no RQS configured
2342 6590010 Original primary group vanishes after newgrp command (USE_QSUB_GID=true)
2344 6590079 Resource reservation broken with sequences of identical jobs differing only in their -R y|n
2346 6604155 qmon binary job submit is broken
2351 6597463 qsub -t 1-N:N creates a normal job with one task
2352 6594665 Installation fails on Linux with glibc 2.6
2353 6597423 commit method of UnixLoginModule does not report RuntimeExceptions
2356 6600619 Userset spooling in classic mode is broken
2367 6597547 qdel does not recognize wc_job_range_list as it is defined
2369 6577034 Several qconf options display only single message when a list of messages should be printed
2372 6469494 clients should issue a more explicit error message when qmaster is busy
2374 6589459 Expose the availability of keyword "none" in the manual page of calendar_conf
2382 6569862 Unset old_value out of the scope
2383 6553062 qconf -mc accepts erroneous resource entries without an urgency; qmon gives (poor) error message
2387 6614041 Multiple occurrence of a name in RQS limit definition break classic spooling
2392 6614108 Specifying more than one drmaa_v_env attribute causes spurious error msg
2394 6608259 scheduler prints empty line in messages file after every 'sge_mirror' logging
2396 6608236 scheduling of parallel jobs does not respect consumables, if consumable is referenced in rqs
2400 6564543 sge_shepherd should exit if it cannot write to any of its essential files
2401 6617450 add option to reporting_params for switching off writing of consumables
2404 6618328 qmon displays wrong string for queue filtering
2406 6596931 Incorrect messages in qconf command
2407 6618619 the restore feature does not delete old configuration before restoring
2409 6619016 removing parameters from the reporting_params will not fallback to the default
2410 6619657 qmod -e|-d '*' times out in large clusters
2411 6619662 qhost becomes sluggish in large clusters
2414 6618599 Long running jobs cause incorrect usage summary for ARCo database
2415 6620930 ARCO view_accounting filters out parallel job usage incorrectly
2416 6621482 ju_exit_status should provide means to recognize the intermediate record
2417 6622842 the start_time field in intermediate accounting records is incorrect
2418 6588743 qrsh fails with "connection refused" error message
2419 6391244 qstat -ext reports wrong usage as compared to other commands such as qstat -t or qstat -j
2424 6620253 During the installation the admin user should create web.xml file
2428 6630268 upgrade from 6.0u2 and higher to 6.1u2 and higher does not work
2435 6599335 inst_sge help output for -upd switch is incorrect
Bugs fixed in SGE 6.1u2 since release 6.1u1
-------------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
- 6590960 Man pages show the wrong version number
2345 6590574 resource quota can prevent dispatching of jobs that requests no resource in this quota
2343 6589807 newline missing from "illegal debug level format" message
2338 6585721 Parallel RR broken if jobs wait for queue slots and no RQS configured
2334 6584632 user/system/operator hold state combinations cause strange qstat output
Bugs fixed in SGE 6.1u1 since release 6.1
-----------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
2323 6576153 Creating a userset with NONE as a type results in a core dump
2317 6574565 Oralce, Postgres FOREIGN KEY fields need to be indexed
- 6573980 'qconf -help' suggests usage of patterns in user_list which is not true
2316 6573508 qrsh with ssh causes job to go in error state when Ctrl-C is pressed
2308 6572803 qhost -xml lacks '>' with initial qhost tag
2309 6572801 sge_queue_values definition does not contain PRIMARY KEY
2321 6571714 Inadequate error message when qconf -sstree is run when no share tree is configuredIf no share tree
2241 6568712 util/arch has problem recognizing libc version number with comma
2292 6568578 6.1 upgrade procedure shall exit when there are jobs in the cluster
2249 6568575 SGE does not work if primary group entry is too big in groups map
2284 6565841 Oracle: rollback segments keep filling up, Postgres: delete query keeps running
2306 6564592 SGE 6.1 upgrade procedure is broken when using the classic qmaster spooling
2275 6564503 sge_schedd deadlock upon schedd_job_info job_list being enabled
2250 6558006 qmaster may crash with projects or usersets used in RQS
2243 6555744 qmon crashes when displaying about dialog
2248 6554313 add -u <user> to scheduler category only if there is a resource quota for the user
2238 6551568 need faster resource quota matchmaking and more concise job info messages
- 6550718 qstat -j lacks resource quota info messages in case of "incomplete" resource quotas
2296 6548455 csp mode installation, using /etc/services, qmaster is not starting!
2232 6546807 qhost -j -xml does not work
2325 6542987 drmaa_run_job(3) raises error if drmaa_native_specification has leading spaces
2239 6542137 use of hostgroups in resource quotas is less performant than the full list of hosts
- 6541085 NFS write error on N1GE trace file
2300 6539199 qquota(1) filtering broken for project and pe if -P/-pe switch is not used
2299 6536039 sgeremoterun not working
2201 6529974 Use of MORE fails on some architectures
- 6528949 inst_sge -ux uninstallation of exechost tries to delete local spooldir, even it isn't configured!
2191 6525883 qstat -s hX filtering is broken on darwin
2189 6525375 qacct ignores jobs in output
2320 6513115 in qmon, under calendar configuration, it is possible to modify even if no calendar exists
2326 6506661 sge_conf(5), description for rlogin_daemon and qlogin_daemon is wrong
2307 6433628 qconf -sq all.q@myhost produces no value at all for complex_values (not even NONE)
2289 6565951 Qmon panel does not check for valid data in Scheduler Configuration
2314 6513116 Qmon x qconf inconsistent in allowed characters in attribute names
- 6195248 QMON Job Control Window: Incomprehensible Priority Button
2313 6410592 Double clicking in Consumables/Fixed Attributes list does not behave as a GUI should
2312 6482211 complex attributes whose deletion is denied donot reflect back after the denial message in qmon
2301 6551121 Memory leak in libdrmaa.so
916 6355875 qsub -terse to just output job id
- 6522273 Wrong exit code with qconf -sds
2266 6563346 Wrong usage of 'day' format model in trunc(date) Oracle functions
2187 6562190 memory leak in sge_schedd
2265 6280747 qmon loses sharetree changes
747 6291044 "Modify"-Button is activated but should be grayed
2263 6553066 qmon's Complex Configuration Load and Save buttons did not work
2262 4742097 Qmon has a ticket number limitation
1729 4818801 qmon on secondary screen crashes when "Job Control" is pressed
2261 6538740 clear usage operation should implicitely trigger refresh in share-tree dialogue
2260 6327539 Ability to sort queue instances using each column of the queue instances table
2229 6544869 UNKNOWN group/owner in accouting(5)
2247 6556411 DBW queries "Average Job Turnaround Time", "Average Job Wait Time" might not work
- 6481737 Arco should support webconsole 3.0.x
- 6559385 Calling JGDI getQueueInstanceSummary results in a memory leak
1813 6328064 Queue request -q from sge_request can't be overridden through command line
- 6355674 arcorun can not be used as sge_admin user if the toc file is not available
2164 6514085 Need a possibility to update existing example queries for the ARCo web application
- 6426331 remove util/sge_log_tee from distribution
- 6476263 function job_get_id_string() is not MT save and used in qmaster
2219 6536426 inst_sge -m fails for non-root when USER variable is not set
1860 6345522 qdel on a job in deleted state does not output any information
2258 5081743 queue status in reporting file is missing.
2050 6422335 still used usersets/project/calendar/pe/checkpoint can be removed under certain conditions
Bugs fixed in SGE 6.1 since release 6.1_beta
--------------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
1941 5086007 qstat -qs doesn't work
2183 6499217 meaningless error in clients when reporting_param flush_time is incorrectly set
- 6525497 JGDI crashes JVM when null is passed to JNI GetStringUTFChars function
2220 6440226 add installation of SGE_Helper_Service to auto installation
2221 6521802 the binary check in inst_sge is wrong!
- 6537633 Extraneous space in qsub's "Invalid month specification." message
2222 6538293 Hybrid user/project share-tree is broken for user sharing amongst array jobs
2180 6518684 Qconf usage x man page inconsistency
2181 6518689 Project man page contains different attribute names.
2171 6516288 Scheduler does not write pid file in daemonize phase
2178 6518607 invalid memory access in cl_com_get_handle
- 6520761 add background mode to N1 Grid Engine Helper Service
- 6233523 loadcheck reports on a hyperthreaded CPU only one processor
- 6276612 provide support for Itanium platform
752 6288953 scalability issue with qdel and very large array jobs
751 6291047 qconf -sstnode cannot find root
- 6303750 Install guide ambiguous on role of CSP
1930 6329378 incorrect qsub error message, if an invalid integer value is passed to the -l option
1858 6344960 qtcsh behaves differently in direct mode from qrshmode
1933 6349037 "qstat -explain E" displays explanation of the same error two times.
1940 6362523 qstat -q filter does compare hosts in queue instances
- 6363245 on some Windows execution hosts, execd hangs after the job has finished
1978 6383256 no newline at end of sge_shepherd's exit_status messages
- 6395078 wrong entry in sgepasswd file wrongly sets whole host in error state
2012 6402127 qconf -suserl reports incorrect status if no users are defined
- 6403152 qconf -as returns error code 0 even in case of unresolvable host
2015 6403810 JavaDocs for DRMAA need improvement
- 6428621 add a reserved complex value to control displaying Windows GUIs
- 6453426 Event clients will not get list updates, when they change their subscription after the registration
- 6461308 Wrong path to spooled parallel jobs with using classic spooling
2130 6501447 No online usage for MacOS X
2141 6506701 sge_shepherd dumps core on linux amd64 for qrsh jobs with very long cmdline (> 10k)
2233 6528950 modifying a RQS with invalid syntax results in its deletion
- 6533952 Admins guide does not mention that parallel environments must be linked with queues
- 6535768 Upgrade chapter 5 in 6.1 install guide must mention abolition of LD_LIBRARY_PATH for Solaris/Linux
- 6535775 Upgrade chapter in 6.1 Install Guide wrongly indicates upgrade from 5.3 were possible
- 6537476 6.1 install guide broken and incomplete wrts MySQL installation for ARCO
- 6537607 6.1 Admins guide needs improvement on the linking between queues and parallel environments
- 6539215 quota verification time may not grow with the number of queues
2224 6539792 resource quotas broken after qmaster restart
- 6542483 Important changes with Resource Quota chapter in 6.1 admins guide
- 6545277 sge_statistic tables are not documented
2230 6546370 Pivot for ARCo Accounting Queries does not show all the fields
2231 6546802 qstat -F -xml does not show resources
Bugs fixed in SGE 6.1_beta since release 6.1_preview2
-----------------------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
- 6267190 Typo before "About the urgent priority" in Admin Guide
1445 6291021 64 bit solaris BDB rpc server broken
1703 6295319 Admin guide: refers to sge_host(5) instead of host_conf(5)
- 6344917 Error in Embedded Command Line Options example
- 6395075 on Windows, execd doesn't provide useful error messages when SSL keys broken
2188 6421113 CSP mode auto installation: certificates are not copied to submit hosts
- 6444526 Admin guide describes N1GE backup facility, but restore is not described
2196 6472614 Auto installation option failed to save the install log
2182 6513433 remote installation of execd's need enhancement, rework, cleanup
2139 6506690 dbwriter should not use autocommit mode
- 6520257 need to define continuation character behaviour with qconf file formats
- 6521285 describe useful characters for every parameter
2185 6522385 qmon crash in cluster configuration dialog when modifying a host
2192 6525917 qacct -l h=<hostname> dumps core on darwin and linux itanium
2198 6528808 sge_ca script fails on nfs no root access file systems
2202 6530335 qmaster aborts when a resource quota set is modified while jobs are running
2204 6531317 qstat -xml does not show pending/zombie jobs
2206 6531921 qstat -r -xml is not working
2207 6533754 resource quota are modified on qconf -mrqs, even if the editor is exited without saving
Bugs fixed in SGE 6.1_preview2 since release 6.1_preview1
---------------------------------------------------------
Issue Sun BugId Description
-------- --------- ------------------------------------------------------------------------------------------
- 5093930 ARCo should work with MySQL
- 5101053 Regular expressions should also be mentioned in qsub in addition to complex
- 5101735 Needs more boolean operators support for resource requests
56 6205203 Logical OR operator works only with complex attributes of type RESTRING
2135 6506115 Invalid qconf -mattr crashes qmaster
2150 6507572 qconf -Arqs added invalid RQS
2146 6510635 Default requests for complexes not honored by resource quotas
2161 6513944 qmaster core dump with usersets referenced in RQS
2162 6513967 unix groups are not considered by RQS
2166 6515122 add -wd working_dir in addition to -cwd option for submission
Bugs fixed in SGE 6.0u11 since release 6.0u10
--------------------------------------------
scalability issue with qdel and very large array jobs
loadcheck reports on a hyperthreaded CPU only one processor
CSP mode auto installation: certificates are not copied to submit hosts
qmon crashes when displaying about dialog
sge_schedd deadlock upon schedd_job_info job_list being enabled
Qmon has a ticket number limitation
qmon on secondary screen crashes when "Job Control" is pressed
queue status in reporting file is missing.
"Modify"-Button is activated but should be grayed
reporting(5) man page lacks information about sharelog records
Ability to sort queue instances using each column of the queue instances
table
Queue request -q from sge_request can't be overridden through command line
qdel on a job in deleted state does not output any information
qsub -terse to just output job id
qrsh man page doesn't explain which options don't work with interactive jobs
Numbers in error mail too large
still used usersets/project/calendar/pe/checkpoint can be removed under
certain conditions
remove util/sge_log_tee from distribution
shell_start_mode should be documentated to be only used for batch jobs
reserved usage not explained in sge_conf.5
Discrepancy between load values reported by Gridengine and from the
HP-UX 64 bit env.
auto installation option failed to save the install log
function job_get_id_string() is not MT save and used in qmaster
Broken output of job name with 'qsub -N'
meaningless error in clients when reporting_param flush_time is
incorrectly set
remote installation of execd's need enhancement, rework, cleanup
Scheduler does not write pid file in daemonize phase
invalid memory access in cl_com_get_handle
Qconf usage x man page inconsistency
Project man page contains different attribute names.
the binary check in inst_sge is wrong!
Wrong exit code with qconf -sds
qacct -l h=<hostname> dumps core on darwin and linux itanium
inst_sge -m fails for non-root when USER variable is not set
Extraneous space in qsub's "Invalid month specification." message
Hybrid user/project share-tree is broken for user sharing amongst array
jobs
clear usage operation should implicitely trigger refresh in share-tree
dialogue
UNKNOWN group/owner in accouting(5)
qmon's Complex Configuration Load and Save buttons did not work
memory leak in sge_schedd
Bugs fixed in SGE 6.0u10 since release 6.0u9
--------------------------------------------
qdel of just submitted job
can't start qmaster message appears, but qmaster is started
inst_sge script does not add the master host to the shadow_masters file
wrong qlogin_daemon or rlogin_daemon in host conf doesn't set host and
job into error state
reprioritize field in qmon cluster config missing
hostname resolving should not be case sensitive
drmaa_version() function should return 1.0
DRMAA sessions should be persistent
drmaa_control() causes illegal memory access
getDrmaaImplementation() should return the same string as getDrmSystem()
qconf rejects configuration, when attribute value ends with a space
character
during QMaster installation, creation of local database directory fails
on hp11
the execd auto_install takes too long because of long delays after a
parallel install block
Unable to modify Advanced Settings in Configuration for Host in my
cluster using qmon
qacct -l h=<hostname> dump core
shared library name DT_SONAME not set with libdrmaa.so
Berkeley DB backup failed when using hostname with a fully qualified
domain name
the inst_common.sh is missing $SGE_EXECD_PORT
potential security issues in cull library
inst_sge -help wrongly indicates -bup/-rst works with BDB spooling only
qping segfaults in ssl mode
The sge_share_mon utility does not work with the automatic policy enforcement
the auto uninstall execd needs a ssh deamon when the uninstall is done local
accounting records for slave tasks of pe jobs contain invalid submission time
DRMAA 1.0 interface need complete documentation in man pages
qselect matches wrong resources which have been overridden at lower level
qsh does not work if XAUTHORITY is set in root environment
In certain cases jobs may stay in "t" state for 5 minutes
qmaster logging "scheduler tried to remove a incomplete"
ignore_fqdn is broken for the local configuration
host already exists when modifying cluster settings
segmentation fault with empty string
qmaster performance gets throttled if qsub -sync y is used when many
jobs are in the system
sge_conf(5) setting 'max_u_jobs' broken if BDB spooling is used
job control: sorting by different fields
qmon job control: display wider default columns
load formula does not recognize float as weighting factor
qmaster dies when modifying slots value for queue domain when queuename
is missing
spooledit cannot dump USERSET objects
default xterm path in arch_variables script not correct for darwin
architectures
qsh problems on darwin architecture because of wrong crypto lib
execution daemon can crash on Linux where libnss_ldap.so uses BDB 4.2
shared library
Bugs fixed in SGE 6.0u9 since release 6.0u8
-------------------------------------------
CSP mode is affected by OpenSSL Security Advisory [28th September 2006]
account string does not accept the "|" character
unreasonably long scheduler dispatch times if lots of projects are used
in share tree
unreasonably long scheduler dispatch times if lots of cluster queues are
deployed in large clusters
jobs with negative priority will be rejected by qmaster
the owner of all installed files is a non-root user. This change
affects most packages that require special users or groups by making
them use the specified unprivileged user and group instead.
(1) Add two new variables PKG_GROUPS_VARS and PKG_USERS_VARS to
unprivileged.mk. These two variables are lists of other bmake
variables that define package-specific users and groups. Packages
that have user-settable variables for users and groups, e.g. apache
and APACHE_{USER,GROUP}, courier-mta and COURIER_{USER,GROUP},
etc., should list these variables in PKG_USERS_VARS and PKG_GROUPS_VARS
so that unprivileged.mk can know to set them to ${UNPRIVILEGED_USER}
and ${UNPRIVILEGED_GROUP}.
(2) Modify packages to use PKG_GROUPS_VARS and PKG_USERS_VARS.
packages with the modular Xorg equivalent. Those are falling back
to the old location by default, so this commmit doesn't change
dependencies.
graphics/xpm ==> x11/libXpm
fonts/Xft2 ==> x11/libXft
x11/Xfixes ==> x11/libXfixes
x11/xcursor ==> x11/libXcursor
x11/Xrender ==> x11/libXrender
x11/Xrandr ==> libXrandr
developer is officially maintaining the package.
The rationale for changing this from "tech-pkg" to "pkgsrc-users" is
that it implies that any user can try to maintain the package (by
submitting patches to the mailing list). Since the folks most likely
to care about the package are the folks that want to use it or are
already using it, this would leverage the energy of users who aren't
developers.
backslashes anymore. A single backslash is enough. Changed the
definition in all affected packages. For those that are not caught, an
additional check is placed into bsd.pkginstall.mk.
around at either build-time or at run-time is:
USE_TOOLS+= perl # build-time
USE_TOOLS+= perl:run # run-time
Also remove some places where perl5/buildlink3.mk was being included
by a package Makefile, but all that the package wanted was the Perl
executable.
Several changes are involved since they are all interrelated. These
changes affect about 1000 files.
The first major change is rewriting bsd.builtin.mk as well as all of
the builtin.mk files to follow the new example in bsd.builtin.mk.
The loop to include all of the builtin.mk files needed by the package
is moved from bsd.builtin.mk and into bsd.buildlink3.mk. bsd.builtin.mk
is now included by each of the individual builtin.mk files and provides
some common logic for all of the builtin.mk files. Currently, this
includes the computation for whether the native or pkgsrc version of
the package is preferred. This causes USE_BUILTIN.* to be correctly
set when one builtin.mk file includes another.
The second major change is teach the builtin.mk files to consider
files under ${LOCALBASE} to be from pkgsrc-controlled packages. Most
of the builtin.mk files test for the presence of built-in software by
checking for the existence of certain files, e.g. <pthread.h>, and we
now assume that if that file is under ${LOCALBASE}, then it must be
from pkgsrc. This modification is a nod toward LOCALBASE=/usr. The
exceptions to this new check are the X11 distribution packages, which
are handled specially as noted below.
The third major change is providing builtin.mk and version.mk files
for each of the X11 distribution packages in pkgsrc. The builtin.mk
file can detect whether the native X11 distribution is the same as
the one provided by pkgsrc, and the version.mk file computes the
version of the X11 distribution package, whether it's built-in or not.
The fourth major change is that the buildlink3.mk files for X11 packages
that install parts which are part of X11 distribution packages, e.g.
Xpm, Xcursor, etc., now use imake to query the X11 distribution for
whether the software is already provided by the X11 distribution.
This is more accurate than grepping for a symbol name in the imake
config files. Using imake required sprinkling various builtin-imake.mk
helper files into pkgsrc directories. These files are used as input
to imake since imake can't use stdin for that purpose.
The fifth major change is in how packages note that they use X11.
Instead of setting USE_X11, package Makefiles should now include
x11.buildlink3.mk instead. This causes the X11 package buildlink3
and builtin logic to be executed at the correct place for buildlink3.mk
and builtin.mk files that previously set USE_X11, and fixes packages
that relied on buildlink3.mk files to implicitly note that X11 is
needed. Package buildlink3.mk should also include x11.buildlink3.mk
when linking against the package libraries requires also linking
against the X11 libraries. Where it was obvious, redundant inclusions
of x11.buildlink3.mk have been removed.
under share/examples/rc.d. The variable name already was named
RCD_SCRIPTS_EXAMPLEDIR.
This is from ideas from Greg Woods and others.
Also bumped PKGREVISION for all packages using RCD_SCRIPTS mechanism
(as requested by wiz).
in the process. (More information on tech-pkg.)
Bump PKGREVISION and BUILDLINK_DEPENDS of all packages using libtool and
installing .la files.
Bump PKGREVISION (only) of all packages depending directly on the above
via a buildlink3 include.