upstream/ipython Commit - r3876:4bb2eb4d

add db,resubmit/retries docs

MinRK -

r3876:4bb2eb4d

parent child

docs/source/parallel/parallel_db.txt

0 created 644 +114 0

@@ -0,0 +1,114 b''
	1	.. _parallel_db:
	2
	3	=======================
	4	IPython's Task Database
	5	=======================
	6
	7	The IPython Hub stores all task requests and results in a database. Currently supported backends
	8	are: MongoDB, SQLite (the default), and an in-memory DictDB. The most common use case for
	9	this is clients requesting results for tasks they did not submit, via:
	10
	11	.. sourcecode:: ipython
	12
	13	In [1]: rc.get_result(task_id)
	14
	15	However, since we have this DB backend, we provide a direct query method in the :class:`client`
	16	for users who want deeper introspection into their task history. The :meth:`db_query` method of
	17	the Client is modeled after MongoDB queries, so if you have used MongoDB it should look
	18	familiar. In fact, when the MongoDB backend is in use, the query is relayed directly. However,
	19	when using other backends, the interface is emulated and only a subset of queries is possible.
	20
	21	.. seealso::
	22
	23	MongoDB query docs: http://www.mongodb.org/display/DOCS/Querying
	24
	25	:meth:`Client.db_query` takes a dictionary query object, with keys from the TaskRecord key list,
	26	and values of either exact values to test, or MongoDB queries, which are dicts of The form:
	27	``{'operator' : 'argument(s)'}``. There is also an optional `keys` argument, that specifies
	28	which subset of keys should be retrieved. The default is to retrieve all keys excluding the
	29	request and result buffers. :meth:`db_query` returns a list of TaskRecord dicts. Also like
	30	MongoDB, the `msg_id` key will always be included, whether requested or not.
	31
	32	TaskRecord keys:
	33
	34	=============== =============== =============
	35	Key Type Description
	36	=============== =============== =============
	37	msg_id uuid(bytes) The msg ID
	38	header dict The request header
	39	content dict The request content (likely empty)
	40	buffers list(bytes) buffers containing serialized request objects
	41	submitted datetime timestamp for time of submission (set by client)
	42	client_uuid uuid(bytes) IDENT of client's socket
	43	engine_uuid uuid(bytes) IDENT of engine's socket
	44	started datetime time task began execution on engine
	45	completed datetime time task finished execution (success or failure) on engine
	46	resubmitted datetime time of resubmission (if applicable)
	47	result_header dict header for result
	48	result_content dict content for result
	49	result_buffers list(bytes) buffers containing serialized request objects
	50	queue bytes The name of the queue for the task ('mux' or 'task')
	51	pyin <unused> Python input (unused)
	52	pyout <unused> Python output (unused)
	53	pyerr <unused> Python traceback (unused)
	54	stdout str Stream of stdout data
	55	stderr str Stream of stderr data
	56
	57	=============== =============== =============
	58
	59	MongoDB operators we emulate on all backends:
	60
	61	========== =================
	62	Operator Python equivalent
	63	========== =================
	64	'$in' in
	65	'$nin' not in
	66	'$eq' ==
	67	'$ne' !=
	68	'$ge' >
	69	'$gte' >=
	70	'$le' <
	71	'$lte' <=
	72	========== =================
	73
	74
	75	The DB Query is useful for two primary cases:
	76
	77	1. deep polling of task status or metadata
	78	2. selecting a subset of tasks, on which to perform a later operation (e.g. wait on result, purge records, resubmit,...)
	79
	80	Example Queries
	81	===============
	82
	83
	84	To get all msg_ids that are not completed, only retrieving their ID and start time:
	85
	86	.. sourcecode:: ipython
	87
	88	In [1]: incomplete = rc.db_query({'complete' : None}, keys=['msg_id', 'started'])
	89
	90	All jobs started in the last hour by me:
	91
	92	.. sourcecode:: ipython
	93
	94	In [1]: from datetime import datetime, timedelta
	95
	96	In [2]: hourago = datetime.now() - timedelta(1./24)
	97
	98	In [3]: recent = rc.db_query({'started' : {'$gte' : hourago },
	99	'client_uuid' : rc.session.session})
	100
	101	All jobs started more than an hour ago, by clients other than me:
	102
	103	.. sourcecode:: ipython
	104
	105	In [3]: recent = rc.db_query({'started' : {'$le' : hourago },
	106	'client_uuid' : {'$ne' : rc.session.session}})
	107
	108	Result headers for all jobs on engine 3 or 4:
	109
	110	.. sourcecode:: ipython
	111
	112	In [1]: uuids = map(rc._engines.get, (3,4))
	113
	114	In [2]: hist34 = rc.db_query({'engine_uuid' : {'$in' : uuids }, keys='result_header')

IPython/parallel/client/client.py

0 +3 -1

                     query : mongodb query dict
                         The search dict. See mongodb query docs for details.
                     keys : list of strs [optional]
-                        THe subset of keys to be returned.  The default is to fetch everything.
+                        The subset of keys to be returned.  The default is to fetch everything but buffers.
                         'msg_id' will *always* be included.
                     """
+                    if isinstance(keys, basestring):
+                        keys = [keys]
                     content = dict(query=query, keys=keys)
                     self.session.send(self._query_socket, "db_request", content=content)
                     idents, msg = self.session.recv(self._query_socket, 0)

docs/source/parallel/index.txt

0 +1 0

                parallel_multiengine.txt
                parallel_task.txt
                parallel_mpi.txt
+               parallel_db.txt
                parallel_security.txt
                parallel_winhpc.txt
                parallel_demos.txt

docs/source/parallel/parallel_task.txt

0 +24 0

             Impossible Dependencies
             ***********************
                 This analysis has not been proven to be rigorous, so it is likely possible for tasks
                 to become impossible to run in obscure situations, so a timeout may be a good choice.
+            Retries and Resubmit
+            ====================
+            Retries
+            -------
+            Another flag for tasks is `retries`.  This is an integer, specifying how many times
+            a task should be resubmitted after failure.  This is useful for tasks that should still run
+            if their engine was shutdown, or may have some statistical chance of failing.  The default
+            is to not retry tasks.
+            Resubmit
+            --------
+            Sometimes you may want to re-run a task. This could be because it failed for some reason, and
+            you have fixed the error, or because you want to restore the cluster to an interrupted state.
+            For this, the :class:`Client` has a :meth:`rc.resubmit` method.  This simply takes one or more
+            msg_ids, and returns an :class:`AsyncHubResult` for the result(s).  You cannot resubmit
+            a task that is pending - only those that have finished, either successful or unsuccessful.
             .. _parallel_schedulers:
             Schedulers
                 TODO: performance comparisons
             More details
             ============

General Comments 0

Write
Preview

You need to be logged in to leave comments. Login now

No TODOs yet

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages