upstream/ipython Commit - r8447:48dd8a34

1

.. _parallel_task:

1

.. _parallel_task:

2

3

==========================

3

==========================

4

The IPython task interface

4

The IPython task interface

5

==========================

5

==========================

6

7

The task interface to the cluster presents the engines as a fault tolerant,

7

The task interface to the cluster presents the engines as a fault tolerant,

8

dynamic load-balanced system of workers. Unlike the multiengine interface, in

8

dynamic load-balanced system of workers. Unlike the multiengine interface, in

9

the task interface the user have no direct access to individual engines. By

9

the task interface the user have no direct access to individual engines. By

10

allowing the IPython scheduler to assign work, this interface is simultaneously

10

allowing the IPython scheduler to assign work, this interface is simultaneously

11

simpler and more powerful.

11

simpler and more powerful.

12

13

Best of all, the user can use both of these interfaces running at the same time

13

Best of all, the user can use both of these interfaces running at the same time

14

to take advantage of their respective strengths. When the user can break up

14

to take advantage of their respective strengths. When the user can break up

15

the user's work into segments that do not depend on previous execution, the

15

the user's work into segments that do not depend on previous execution, the

16

task interface is ideal. But it also has more power and flexibility, allowing

16

task interface is ideal. But it also has more power and flexibility, allowing

17

the user to guide the distribution of jobs, without having to assign tasks to

17

the user to guide the distribution of jobs, without having to assign tasks to

18

engines explicitly.

18

engines explicitly.

19

20

Starting the IPython controller and engines

20

Starting the IPython controller and engines

21

===========================================

21

===========================================

22

23

To follow along with this tutorial, you will need to start the IPython

23

To follow along with this tutorial, you will need to start the IPython

24

controller and four IPython engines. The simplest way of doing this is to use

24

controller and four IPython engines. The simplest way of doing this is to use

25

the :command:`ipcluster` command::

25

the :command:`ipcluster` command::

26

27

$ ipcluster start -n 4

27

$ ipcluster start -n 4

28

29

For more detailed information about starting the controller and engines, see

29

For more detailed information about starting the controller and engines, see

30

our :ref:`introduction <parallel_overview>` to using IPython for parallel computing.

30

our :ref:`introduction <parallel_overview>` to using IPython for parallel computing.

31

32

Creating a ``LoadBalancedView`` instance

32

Creating a ``LoadBalancedView`` instance

33

========================================

33

========================================

34

35

The first step is to import the IPython :mod:`IPython.parallel`

35

The first step is to import the IPython :mod:`IPython.parallel`

36

module and then create a :class:`.Client` instance, and we will also be using

36

module and then create a :class:`.Client` instance, and we will also be using

37

a :class:`LoadBalancedView`, here called `lview`:

37

a :class:`LoadBalancedView`, here called `lview`:

38

39

.. sourcecode:: ipython

39

.. sourcecode:: ipython

40

41

In [1]: from IPython.parallel import Client

41

In [1]: from IPython.parallel import Client

42

43

In [2]: rc = Client()

43

In [2]: rc = Client()

44

45

46

This form assumes that the controller was started on localhost with default

46

This form assumes that the controller was started on localhost with default

47

configuration. If not, the location of the controller must be given as an

47

configuration. If not, the location of the controller must be given as an

48

argument to the constructor:

48

argument to the constructor:

49

50

.. sourcecode:: ipython

50

.. sourcecode:: ipython

51

52

# for a visible LAN controller listening on an external port:

52

# for a visible LAN controller listening on an external port:

53

In [2]: rc = Client('tcp://192.168.1.16:10101')

53

In [2]: rc = Client('tcp://192.168.1.16:10101')

54

# or to connect with a specific profile you have set up:

54

# or to connect with a specific profile you have set up:

55

In [3]: rc = Client(profile='mpi')

55

In [3]: rc = Client(profile='mpi')

56

57

For load-balanced execution, we will make use of a :class:`LoadBalancedView` object, which can

57

For load-balanced execution, we will make use of a :class:`LoadBalancedView` object, which can

58

be constructed via the client's :meth:`load_balanced_view` method:

58

be constructed via the client's :meth:`load_balanced_view` method:

59

60

.. sourcecode:: ipython

60

.. sourcecode:: ipython

61

62

In [4]: lview = rc.load_balanced_view() # default load-balanced view

62

In [4]: lview = rc.load_balanced_view() # default load-balanced view

63

64

.. seealso::

64

.. seealso::

65

66

For more information, see the in-depth explanation of :ref:`Views <parallel_details>`.

66

For more information, see the in-depth explanation of :ref:`Views <parallel_details>`.

67

68

69

Quick and easy parallelism

69

Quick and easy parallelism

70

==========================

70

==========================

71

72

In many cases, you simply want to apply a Python function to a sequence of

72

In many cases, you simply want to apply a Python function to a sequence of

73

objects, but *in parallel*. Like the multiengine interface, these can be

73

objects, but *in parallel*. Like the multiengine interface, these can be

74

implemented via the task interface. The exact same tools can perform these

74

implemented via the task interface. The exact same tools can perform these

75

actions in load-balanced ways as well as multiplexed ways: a parallel version

75

actions in load-balanced ways as well as multiplexed ways: a parallel version

76

of :func:`map` and :func:`@parallel` function decorator. If one specifies the

76

of :func:`map` and :func:`@parallel` function decorator. If one specifies the

77

argument `balanced=True`, then they are dynamically load balanced. Thus, if the

77

argument `balanced=True`, then they are dynamically load balanced. Thus, if the

78

execution time per item varies significantly, you should use the versions in

78

execution time per item varies significantly, you should use the versions in

79

the task interface.

79

the task interface.

80

81

Parallel map

81

Parallel map

82

------------

82

------------

83

84

To load-balance :meth:`map`,simply use a LoadBalancedView:

84

To load-balance :meth:`map`,simply use a LoadBalancedView:

85

86

.. sourcecode:: ipython

86

.. sourcecode:: ipython

87

88

In [62]: lview.block = True

88

In [62]: lview.block = True

89

90

In [63]: serial_result = map(lambda x:x**10, range(32))

90

In [63]: serial_result = map(lambda x:x**10, range(32))

91

92

In [64]: parallel_result = lview.map(lambda x:x**10, range(32))

92

In [64]: parallel_result = lview.map(lambda x:x**10, range(32))

93

94

In [65]: serial_result==parallel_result

94

In [65]: serial_result==parallel_result

95

Out[65]: True

95

Out[65]: True

96

97

Parallel function decorator

97

Parallel function decorator

98

---------------------------

98

---------------------------

99

100

Parallel functions are just like normal function, but they can be called on

100

Parallel functions are just like normal function, but they can be called on

101

sequences and *in parallel*. The multiengine interface provides a decorator

101

sequences and *in parallel*. The multiengine interface provides a decorator

102

that turns any Python function into a parallel function:

102

that turns any Python function into a parallel function:

103

104

.. sourcecode:: ipython

104

.. sourcecode:: ipython

105

106

In [10]: @lview.parallel()

106

In [10]: @lview.parallel()

107

....: def f(x):

107

....: def f(x):

108

....: return 10.0*x**4

108

....: return 10.0*x**4

109

....:

109

....:

110

111

In [11]: f.map(range(32)) # this is done in parallel

111

In [11]: f.map(range(32)) # this is done in parallel

112

Out[11]: [0.0,10.0,160.0,...]

112

Out[11]: [0.0,10.0,160.0,...]

113

114

.. _parallel_dependencies:

114

.. _parallel_dependencies:

115

116

Dependencies

116

Dependencies

117

============

117

============

118

119

Often, pure atomic load-balancing is too primitive for your work. In these cases, you

119

Often, pure atomic load-balancing is too primitive for your work. In these cases, you

120

may want to associate some kind of `Dependency` that describes when, where, or whether

120

may want to associate some kind of `Dependency` that describes when, where, or whether

121

a task can be run. In IPython, we provide two types of dependencies:

121

a task can be run. In IPython, we provide two types of dependencies:

122

`Functional Dependencies`_ and `Graph Dependencies`_

122

`Functional Dependencies`_ and `Graph Dependencies`_

123

124

.. note::

124

.. note::

125

126

It is important to note that the pure ZeroMQ scheduler does not support dependencies,

126

It is important to note that the pure ZeroMQ scheduler does not support dependencies,

127

and you will see errors or warnings if you try to use dependencies with the pure

127

and you will see errors or warnings if you try to use dependencies with the pure

128

scheduler.

128

scheduler.

129

130

Functional Dependencies

130

Functional Dependencies

131

-----------------------

131

-----------------------

132

133

Functional dependencies are used to determine whether a given engine is capable of running

133

Functional dependencies are used to determine whether a given engine is capable of running

134

a particular task. This is implemented via a special :class:`Exception` class,

134

a particular task. This is implemented via a special :class:`Exception` class,

135

:class:`UnmetDependency`, found in `IPython.parallel.error`. Its use is very simple:

135

:class:`UnmetDependency`, found in `IPython.parallel.error`. Its use is very simple:

136

if a task fails with an UnmetDependency exception, then the scheduler, instead of relaying

136

if a task fails with an UnmetDependency exception, then the scheduler, instead of relaying

137

the error up to the client like any other error, catches the error, and submits the task

137

the error up to the client like any other error, catches the error, and submits the task

138

to a different engine. This will repeat indefinitely, and a task will never be submitted

138

to a different engine. This will repeat indefinitely, and a task will never be submitted

139

to a given engine a second time.

139

to a given engine a second time.

140

141

You can manually raise the :class:`UnmetDependency` yourself, but IPython has provided

141

You can manually raise the :class:`UnmetDependency` yourself, but IPython has provided

142

some decorators for facilitating this behavior.

142

some decorators for facilitating this behavior.

143

144

There are two decorators and a class used for functional dependencies:

144

There are two decorators and a class used for functional dependencies:

145

146

.. sourcecode:: ipython

146

.. sourcecode:: ipython

147

148

In [9]: from IPython.parallel import depend, require, dependent

148

In [9]: from IPython.parallel import depend, require, dependent

149

150

@require

150

@require

151

********

151

********

152

153

The simplest sort of dependency is requiring that a Python module is available. The

153

The simplest sort of dependency is requiring that a Python module is available. The

154

``@require`` decorator lets you define a function that will only run on engines where names

154

``@require`` decorator lets you define a function that will only run on engines where names

155

you specify are importable:

155

you specify are importable:

156

157

.. sourcecode:: ipython

157

.. sourcecode:: ipython

158

159

In [10]: @require('numpy', 'zmq')

159

In [10]: @require('numpy', 'zmq')

160

....: def myfunc():

160

....: def myfunc():

161

....: return dostuff()

161

....: return dostuff()

162

163

Now, any time you apply :func:`myfunc`, the task will only run on a machine that has

163

Now, any time you apply :func:`myfunc`, the task will only run on a machine that has

164

numpy and pyzmq available, and when :func:`myfunc` is called, numpy and zmq will be imported.

164

numpy and pyzmq available, and when :func:`myfunc` is called, numpy and zmq will be imported.

165

166

@depend

166

@depend

167

*******

167

*******

168

169

The ``@depend`` decorator lets you decorate any function with any *other* function to

169

The ``@depend`` decorator lets you decorate any function with any *other* function to

170

evaluate the dependency. The dependency function will be called at the start of the task,

170

evaluate the dependency. The dependency function will be called at the start of the task,

171

and if it returns ``False``, then the dependency will be considered unmet, and the task

171

and if it returns ``False``, then the dependency will be considered unmet, and the task

172

will be assigned to another engine. If the dependency returns *anything other than

172

will be assigned to another engine. If the dependency returns *anything other than

173

``False``*, the rest of the task will continue.

173

``False``*, the rest of the task will continue.

174

175

.. sourcecode:: ipython

175

.. sourcecode:: ipython

176

177

In [10]: def platform_specific(plat):

177

In [10]: def platform_specific(plat):

178

....: import sys

178

....: import sys

179

....: return sys.platform == plat

179

....: return sys.platform == plat

180

181

In [11]: @depend(platform_specific, 'darwin')

181

In [11]: @depend(platform_specific, 'darwin')

182

....: def mactask():

182

....: def mactask():

183

....: do_mac_stuff()

183

....: do_mac_stuff()

184

185

In [12]: @depend(platform_specific, 'nt')

185

In [12]: @depend(platform_specific, 'nt')

186

....: def wintask():

186

....: def wintask():

187

....: do_windows_stuff()

187

....: do_windows_stuff()

188

189

In this case, any time you apply ``mytask``, it will only run on an OSX machine.

189

In this case, any time you apply ``mytask``, it will only run on an OSX machine.

190

``@depend`` is just like ``apply``, in that it has a ``@depend(f,*args,**kwargs)``

190

``@depend`` is just like ``apply``, in that it has a ``@depend(f,*args,**kwargs)``

191

signature.

191

signature.

192

193

dependents

193

dependents

194

**********

194

**********

195

196

You don't have to use the decorators on your tasks, if for instance you may want

196

You don't have to use the decorators on your tasks, if for instance you may want

197

to run tasks with a single function but varying dependencies, you can directly construct

197

to run tasks with a single function but varying dependencies, you can directly construct

198

the :class:`dependent` object that the decorators use:

198

the :class:`dependent` object that the decorators use:

199

200

.. sourcecode::ipython

200

.. sourcecode::ipython

201

202

In [13]: def mytask(*args):

202

In [13]: def mytask(*args):

203

....: dostuff()

203

....: dostuff()

204

205

In [14]: mactask = dependent(mytask, platform_specific, 'darwin')

205

In [14]: mactask = dependent(mytask, platform_specific, 'darwin')

206

# this is the same as decorating the declaration of mytask with @depend

206

# this is the same as decorating the declaration of mytask with @depend

207

# but you can do it again:

207

# but you can do it again:

208

209

In [15]: wintask = dependent(mytask, platform_specific, 'nt')

209

In [15]: wintask = dependent(mytask, platform_specific, 'nt')

210

211

# in general:

211

# in general:

212

In [16]: t = dependent(f, g, *dargs, **dkwargs)

212

In [16]: t = dependent(f, g, *dargs, **dkwargs)

213

214

# is equivalent to:

214

# is equivalent to:

215

In [17]: @depend(g, *dargs, **dkwargs)

215

In [17]: @depend(g, *dargs, **dkwargs)

216

....: def t(a,b,c):

216

....: def t(a,b,c):

217

....: # contents of f

217

....: # contents of f

218

219

Graph Dependencies

219

Graph Dependencies

220

------------------

220

------------------

221

222

Sometimes you want to restrict the time and/or location to run a given task as a function

222

Sometimes you want to restrict the time and/or location to run a given task as a function

223

of the time and/or location of other tasks. This is implemented via a subclass of

223

of the time and/or location of other tasks. This is implemented via a subclass of

224

:class:`set`, called a :class:`Dependency`. A Dependency is just a set of `msg_ids`

224

:class:`set`, called a :class:`Dependency`. A Dependency is just a set of `msg_ids`

225

corresponding to tasks, and a few attributes to guide how to decide when the Dependency

225

corresponding to tasks, and a few attributes to guide how to decide when the Dependency

226

has been met.

226

has been met.

227

228

The switches we provide for interpreting whether a given dependency set has been met:

228

The switches we provide for interpreting whether a given dependency set has been met:

229

230

any|all

230

any|all

231

Whether the dependency is considered met if *any* of the dependencies are done, or

231

Whether the dependency is considered met if *any* of the dependencies are done, or

232

only after *all* of them have finished. This is set by a Dependency's :attr:`all`

232

only after *all* of them have finished. This is set by a Dependency's :attr:`all`

233

boolean attribute, which defaults to ``True``.

233

boolean attribute, which defaults to ``True``.

234

235

success [default: True]

235

success [default: True]

236

Whether to consider tasks that succeeded as fulfilling dependencies.

236

Whether to consider tasks that succeeded as fulfilling dependencies.

237

238

failure [default : False]

238

failure [default : False]

239

Whether to consider tasks that failed as fulfilling dependencies.

239

Whether to consider tasks that failed as fulfilling dependencies.

240

using `failure=True,success=False` is useful for setting up cleanup tasks, to be run

240

using `failure=True,success=False` is useful for setting up cleanup tasks, to be run

241

only when tasks have failed.

241

only when tasks have failed.

242

243

Sometimes you want to run a task after another, but only if that task succeeded. In this case,

243

Sometimes you want to run a task after another, but only if that task succeeded. In this case,

244

``success`` should be ``True`` and ``failure`` should be ``False``. However sometimes you may

244

``success`` should be ``True`` and ``failure`` should be ``False``. However sometimes you may

245

not care whether the task succeeds, and always want the second task to run, in which case you

245

not care whether the task succeeds, and always want the second task to run, in which case you

246

should use `success=failure=True`. The default behavior is to only use successes.

246

should use `success=failure=True`. The default behavior is to only use successes.

247

248

There are other switches for interpretation that are made at the *task* level. These are

248

There are other switches for interpretation that are made at the *task* level. These are

249

specified via keyword arguments to the client's :meth:`apply` method.

249

specified via keyword arguments to the client's :meth:`apply` method.

250

251

after,follow

251

after,follow

252

You may want to run a task *after* a given set of dependencies have been run and/or

252

You may want to run a task *after* a given set of dependencies have been run and/or

253

run it *where* another set of dependencies are met. To support this, every task has an

253

run it *where* another set of dependencies are met. To support this, every task has an

254

`after` dependency to restrict time, and a `follow` dependency to restrict

254

`after` dependency to restrict time, and a `follow` dependency to restrict

255

destination.

255

destination.

256

257

timeout

257

timeout

258

You may also want to set a time-limit for how long the scheduler should wait before a

258

You may also want to set a time-limit for how long the scheduler should wait before a

259

task's dependencies are met. This is done via a `timeout`, which defaults to 0, which

259

task's dependencies are met. This is done via a `timeout`, which defaults to 0, which

260

indicates that the task should never timeout. If the timeout is reached, and the

260

indicates that the task should never timeout. If the timeout is reached, and the

261

scheduler still hasn't been able to assign the task to an engine, the task will fail

261

scheduler still hasn't been able to assign the task to an engine, the task will fail

262

with a :class:`DependencyTimeout`.

262

with a :class:`DependencyTimeout`.

263

264

.. note::

264

.. note::

265

266

Dependencies only work within the task scheduler. You cannot instruct a load-balanced

266

Dependencies only work within the task scheduler. You cannot instruct a load-balanced

267

task to run after a job submitted via the MUX interface.

267

task to run after a job submitted via the MUX interface.

268

269

The simplest form of Dependencies is with `all=True,success=True,failure=False`. In these cases,

269

The simplest form of Dependencies is with `all=True,success=True,failure=False`. In these cases,

270

you can skip using Dependency objects, and just pass msg_ids or AsyncResult objects as the

270

you can skip using Dependency objects, and just pass msg_ids or AsyncResult objects as the

271

`follow` and `after` keywords to :meth:`client.apply`:

271

`follow` and `after` keywords to :meth:`client.apply`:

272

273

.. sourcecode:: ipython

273

.. sourcecode:: ipython

274

275

In [14]: client.block=False

275

In [14]: client.block=False

276

277

In [15]: ar = lview.apply(f, args, kwargs)

277

In [15]: ar = lview.apply(f, args, kwargs)

278

279

In [16]: ar2 = lview.apply(f2)

279

In [16]: ar2 = lview.apply(f2)

280

281

In [17]: with lview.temp_flags(after=[ar,ar2]):

281

In [17]: with lview.temp_flags(after=[ar,ar2]):

282

....: ar3 = lview.apply(f3)

282

....: ar3 = lview.apply(f3)

283

284

In [18]: with lview.temp_flags(follow=[ar], timeout=2.5)

284

In [18]: with lview.temp_flags(follow=[ar], timeout=2.5)

285

....: ar4 = lview.apply(f3)

285

....: ar4 = lview.apply(f3)

286

287

.. seealso::

287

.. seealso::

288

289

Some parallel workloads can be described as a `Directed Acyclic Graph

289

Some parallel workloads can be described as a `Directed Acyclic Graph

290

<http://en.wikipedia.org/wiki/Directed_acyclic_graph>`_, or DAG. See :ref:`DAG

290

<http://en.wikipedia.org/wiki/Directed_acyclic_graph>`_, or DAG. See :ref:`DAG

291

Dependencies <dag_dependencies>` for an example demonstrating how to use map a NetworkX DAG

291

Dependencies <dag_dependencies>` for an example demonstrating how to use map a NetworkX DAG

292

onto task dependencies.

292

onto task dependencies.

293

294

295

Impossible Dependencies

295

Impossible Dependencies

296

***********************

296

***********************

297

298

The schedulers do perform some analysis on graph dependencies to determine whether they

298

The schedulers do perform some analysis on graph dependencies to determine whether they

299

are not possible to be met. If the scheduler does discover that a dependency cannot be

299

are not possible to be met. If the scheduler does discover that a dependency cannot be

300

met, then the task will fail with an :class:`ImpossibleDependency` error. This way, if the

300

met, then the task will fail with an :class:`ImpossibleDependency` error. This way, if the

301

scheduler realized that a task can never be run, it won't sit indefinitely in the

301

scheduler realized that a task can never be run, it won't sit indefinitely in the

302

scheduler clogging the pipeline.

302

scheduler clogging the pipeline.

303

304

The basic cases that are checked:

304

The basic cases that are checked:

305

306

* depending on nonexistent messages

306

* depending on nonexistent messages

307

* `follow` dependencies were run on more than one machine and `all=True`

307

* `follow` dependencies were run on more than one machine and `all=True`

308

* any dependencies failed and `all=True,success=True,failures=False`

308

* any dependencies failed and `all=True,success=True,failures=False`

309

* all dependencies failed and `all=False,success=True,failure=False`

309

* all dependencies failed and `all=False,success=True,failure=False`

310

311

.. warning::

311

.. warning::

312

313

This analysis has not been proven to be rigorous, so it is likely possible for tasks

313

This analysis has not been proven to be rigorous, so it is likely possible for tasks

314

to become impossible to run in obscure situations, so a timeout may be a good choice.

314

to become impossible to run in obscure situations, so a timeout may be a good choice.

315

316

317

Retries and Resubmit

317

Retries and Resubmit

318

====================

318

====================

319

320

Retries

320

Retries

321

-------

321

-------

322

323

Another flag for tasks is `retries`. This is an integer, specifying how many times

323

Another flag for tasks is `retries`. This is an integer, specifying how many times

324

a task should be resubmitted after failure. This is useful for tasks that should still run

324

a task should be resubmitted after failure. This is useful for tasks that should still run

325

if their engine was shutdown, or may have some statistical chance of failing. The default

325

if their engine was shutdown, or may have some statistical chance of failing. The default

326

is to not retry tasks.

326

is to not retry tasks.

327

328

Resubmit

328

Resubmit

329

--------

329

--------

330

331

Sometimes you may want to re-run a task. This could be because it failed for some reason, and

331

Sometimes you may want to re-run a task. This could be because it failed for some reason, and

332

you have fixed the error, or because you want to restore the cluster to an interrupted state.

332

you have fixed the error, or because you want to restore the cluster to an interrupted state.

333

For this, the :class:`Client` has a :meth:`rc.resubmit` method. This simply takes one or more

333

For this, the :class:`Client` has a :meth:`rc.resubmit` method. This simply takes one or more

334

msg_ids, and returns an :class:`AsyncHubResult` for the result(s). You cannot resubmit

334

msg_ids, and returns an :class:`AsyncHubResult` for the result(s). You cannot resubmit

335

a task that is pending - only those that have finished, either successful or unsuccessful.

335

a task that is pending - only those that have finished, either successful or unsuccessful.

336

337

.. _parallel_schedulers:

337

.. _parallel_schedulers:

338

339

Schedulers

339

Schedulers

340

==========

340

==========

341

342

There are a variety of valid ways to determine where jobs should be assigned in a

342

There are a variety of valid ways to determine where jobs should be assigned in a

343

load-balancing situation. In IPython, we support several standard schemes, and

343

load-balancing situation. In IPython, we support several standard schemes, and

344

even make it easy to define your own. The scheme can be selected via the ``scheme``

344

even make it easy to define your own. The scheme can be selected via the ``scheme``

345

argument to :command:`ipcontroller`, or in the :attr:`TaskScheduler.schemename` attribute

345

argument to :command:`ipcontroller`, or in the :attr:`TaskScheduler.schemename` attribute

346

of a controller config object.

346

of a controller config object.

347

348

The built-in routing schemes:

348

The built-in routing schemes:

349

350

To select one of these schemes, simply do::

350

To select one of these schemes, simply do::

351

352

$ ipcontroller --scheme=<schemename>

352

$ ipcontroller --scheme=<schemename>

353

for instance:

353

for instance:

354

$ ipcontroller --scheme=lru

354

$ ipcontroller --scheme=lru

355

356

lru: Least Recently Used

356

lru: Least Recently Used

357

358

Always assign work to the least-recently-used engine. A close relative of

358

Always assign work to the least-recently-used engine. A close relative of

359

round-robin, it will be fair with respect to the number of tasks, agnostic

359

round-robin, it will be fair with respect to the number of tasks, agnostic

360

with respect to runtime of each task.

360

with respect to runtime of each task.

361

362

plainrandom: Plain Random

362

plainrandom: Plain Random

363

364

Randomly picks an engine on which to run.

364

Randomly picks an engine on which to run.

365

366

twobin: Two-Bin Random

366

twobin: Two-Bin Random

367

368

**Requires numpy**

368

**Requires numpy**

369

370

Pick two engines at random, and use the LRU of the two. This is known to be better

370

Pick two engines at random, and use the LRU of the two. This is known to be better

371

than plain random in many cases, but requires a small amount of computation.

371

than plain random in many cases, but requires a small amount of computation.

372

373

leastload: Least Load

373

leastload: Least Load

374

375

**This is the default scheme**

375

**This is the default scheme**

376

377

Always assign tasks to the engine with the fewest outstanding tasks (LRU breaks tie).

377

Always assign tasks to the engine with the fewest outstanding tasks (LRU breaks tie).

378

379

weighted: Weighted Two-Bin Random

379

weighted: Weighted Two-Bin Random

380

381

**Requires numpy**

381

**Requires numpy**

382

383

Pick two engines at random using the number of outstanding tasks as inverse weights,

383

Pick two engines at random using the number of outstanding tasks as inverse weights,

384

and use the one with the lower load.

384

and use the one with the lower load.

385

386

Greedy Assignment

386

Greedy Assignment

387

-----------------

387

-----------------

388

389

Tasks are assigned greedily as they are submitted. If their dependencies are

389

Tasks can be assigned greedily as they are submitted. If their dependencies are

390

met, they will be assigned to an engine right away, and multiple tasks can be

390

met, they will be assigned to an engine right away, and multiple tasks can be

391

assigned to an engine at a given time. This limit is set with the

391

assigned to an engine at a given time. This limit is set with the

392

``TaskScheduler.hwm`` (high water mark) configurable:

392

``TaskScheduler.hwm`` (high water mark) configurable in your

393

:file:`ipcontroller_config.py` config file, with:

393

394

.. sourcecode:: python

395

.. sourcecode:: python

395

396

# the most common choices are:

397

# the most common choices are:

397

c.TaskSheduler.hwm = 0 # (minimal latency, default in IPython ~~≤ 0.12~~)

398

c.TaskSheduler.hwm = 0 # (minimal latency, default in IPython < 0.13)

398

# or

399

# or

399

c.TaskScheduler.hwm = 1 # (most-informed balancing, default in ~~> 0.12~~)

400

c.TaskScheduler.hwm = 1 # (most-informed balancing, default in ≥ 0.13)

400

401

In IPython ~~≤ 0.12,~~the default is 0, or no-limit. That is, there is no limit to the number of

402

In IPython < 0.13, the default is 0, or no-limit. That is, there is no limit to the number of

402

tasks that can be outstanding on a given engine. This greatly benefits the

403

tasks that can be outstanding on a given engine. This greatly benefits the

403

latency of execution, because network traffic can be hidden behind computation.

404

latency of execution, because network traffic can be hidden behind computation.

404

However, this means that workload is assigned without knowledge of how long

405

However, this means that workload is assigned without knowledge of how long

405

each task might take, and can result in poor load-balancing, particularly for

406

each task might take, and can result in poor load-balancing, particularly for

406

submitting a collection of heterogeneous tasks all at once. You can limit this

407

submitting a collection of heterogeneous tasks all at once. You can limit this

407

effect by setting hwm to a positive integer, 1 being maximum load-balancing (a

408

effect by setting hwm to a positive integer, 1 being maximum load-balancing (a

408

task will never be waiting if there is an idle engine), and any larger number

409

task will never be waiting if there is an idle engine), and any larger number

409

being a compromise between load-balance and latency-hiding.

410

being a compromise between load-balancing and latency-hiding.

410

411

In practice, some users have been confused by having this optimization on by

412

In practice, some users have been confused by having this optimization on by

412

default, ~~and~~ the default value has been changed to 1. This can be slower,

413

default, so the default value has been changed to 1 in IPython 0.13. This can be slower,

413

but has more obvious behavior and won't result in assigning too many tasks to

414

but has more obvious behavior and won't result in assigning too many tasks to

414

some engines in heterogeneous cases.

415

some engines in heterogeneous cases.

415

416

417

Pure ZMQ Scheduler

418

Pure ZMQ Scheduler

418

------------------

419

------------------

419

420

For maximum throughput, the 'pure' scheme is not Python at all, but a C-level

421

For maximum throughput, the 'pure' scheme is not Python at all, but a C-level

421

:class:`MonitoredQueue` from PyZMQ, which uses a ZeroMQ ``DEALER`` socket to perform all

422

:class:`MonitoredQueue` from PyZMQ, which uses a ZeroMQ ``DEALER`` socket to perform all

422

load-balancing. This scheduler does not support any of the advanced features of the Python

423

load-balancing. This scheduler does not support any of the advanced features of the Python

423

:class:`.Scheduler`.

424

:class:`.Scheduler`.

424

425

Disabled features when using the ZMQ Scheduler:

426

Disabled features when using the ZMQ Scheduler:

426

427

* Engine unregistration

428

* Engine unregistration

428

Task farming will be disabled if an engine unregisters.

429

Task farming will be disabled if an engine unregisters.

429

Further, if an engine is unregistered during computation, the scheduler may not recover.

430

Further, if an engine is unregistered during computation, the scheduler may not recover.

430

* Dependencies

431

* Dependencies

431

Since there is no Python logic inside the Scheduler, routing decisions cannot be made

432

Since there is no Python logic inside the Scheduler, routing decisions cannot be made

432

based on message content.

433

based on message content.

433

* Early destination notification

434

* Early destination notification

434

The Python schedulers know which engine gets which task, and notify the Hub. This

435

The Python schedulers know which engine gets which task, and notify the Hub. This

435

allows graceful handling of Engines coming and going. There is no way to know

436

allows graceful handling of Engines coming and going. There is no way to know

436

where ZeroMQ messages have gone, so there is no way to know what tasks are on which

437

where ZeroMQ messages have gone, so there is no way to know what tasks are on which

437

engine until they *finish*. This makes recovery from engine shutdown very difficult.

438

engine until they *finish*. This makes recovery from engine shutdown very difficult.

438

439

440

.. note::

441

.. note::

441

442

TODO: performance comparisons

443

TODO: performance comparisons

443

444

445

446

447

More details

448

More details

448

============

449

============

449

450

The :class:`LoadBalancedView` has many more powerful features that allow quite a bit

451

The :class:`LoadBalancedView` has many more powerful features that allow quite a bit

451

of flexibility in how tasks are defined and run. The next places to look are

452

of flexibility in how tasks are defined and run. The next places to look are

452

in the following classes:

453

in the following classes:

453

454

* :class:`~IPython.parallel.client.view.LoadBalancedView`

455

* :class:`~IPython.parallel.client.view.LoadBalancedView`

455

* :class:`~IPython.parallel.client.asyncresult.AsyncResult`

456

* :class:`~IPython.parallel.client.asyncresult.AsyncResult`

456

* :meth:`~IPython.parallel.client.view.LoadBalancedView.apply`

457

* :meth:`~IPython.parallel.client.view.LoadBalancedView.apply`

457

* :mod:`~IPython.parallel.controller.dependency`

458

* :mod:`~IPython.parallel.controller.dependency`

458

459

The following is an overview of how to use these classes together:

460

The following is an overview of how to use these classes together:

460

461

1. Create a :class:`Client` and :class:`LoadBalancedView`

462

1. Create a :class:`Client` and :class:`LoadBalancedView`

462

2. Define some functions to be run as tasks

463

2. Define some functions to be run as tasks

463

3. Submit your tasks to using the :meth:`apply` method of your

464

3. Submit your tasks to using the :meth:`apply` method of your

464

:class:`LoadBalancedView` instance.

465

:class:`LoadBalancedView` instance.

465

4. Use :meth:`.Client.get_result` to get the results of the

466

4. Use :meth:`.Client.get_result` to get the results of the

466

tasks, or use the :meth:`AsyncResult.get` method of the results to wait

467

tasks, or use the :meth:`AsyncResult.get` method of the results to wait

467

for and then receive the results.

468

for and then receive the results.

468

469

.. seealso::

470

.. seealso::

470

471

A demo of :ref:`DAG Dependencies <dag_dependencies>` with NetworkX and IPython.

472

A demo of :ref:`DAG Dependencies <dag_dependencies>` with NetworkX and IPython.

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

             .. _parallel_task:
             ==========================
             The IPython task interface
             ==========================
             The task interface to the cluster presents the engines as a fault tolerant,
             dynamic load-balanced system of workers. Unlike the multiengine interface, in
             the task interface the user have no direct access to individual engines. By
             allowing the IPython scheduler to assign work, this interface is simultaneously
             simpler and more powerful.
             Best of all, the user can use both of these interfaces running at the same time
             to take advantage of their respective strengths. When the user can break up
             the user's work into segments that do not depend on previous execution, the
             task interface is ideal. But it also has more power and flexibility, allowing
             the user to guide the distribution of jobs, without having to assign tasks to
             engines explicitly.
             Starting the IPython controller and engines
             ===========================================
             To follow along with this tutorial, you will need to start the IPython
             controller and four IPython engines. The simplest way of doing this is to use
             the :command:`ipcluster` command::
             	$ ipcluster start -n 4
             For more detailed information about starting the controller and engines, see
             our :ref:`introduction <parallel_overview>` to using IPython for parallel computing.
             Creating a ``LoadBalancedView`` instance
             ========================================
             The first step is to import the IPython :mod:`IPython.parallel`
             module and then create a :class:`.Client` instance, and we will also be using
             a :class:`LoadBalancedView`, here called `lview`:
             .. sourcecode:: ipython
                 In [1]: from IPython.parallel import Client
                 In [2]: rc = Client()
             This form assumes that the controller was started on localhost with default
             configuration. If not, the location of the controller must be given as an
             argument to the constructor:
             .. sourcecode:: ipython
                 # for a visible LAN controller listening on an external port:
                 In [2]: rc = Client('tcp://192.168.1.16:10101')
                 # or to connect with a specific profile you have set up:
                 In [3]: rc = Client(profile='mpi')
             For load-balanced execution, we will make use of a :class:`LoadBalancedView` object, which can
             be constructed via the client's :meth:`load_balanced_view` method:
             .. sourcecode:: ipython
                 In [4]: lview = rc.load_balanced_view() # default load-balanced view
             .. seealso::
                 For more information, see the in-depth explanation of :ref:`Views <parallel_details>`.
             Quick and easy parallelism
             ==========================
             In many cases, you simply want to apply a Python function to a sequence of
             objects, but *in parallel*. Like the multiengine interface, these can be
             implemented via the task interface. The exact same tools can perform these
             actions in load-balanced ways as well as multiplexed ways: a parallel version
             of :func:`map` and :func:`@parallel` function decorator. If one specifies the
             argument `balanced=True`, then they are dynamically load balanced. Thus, if the
             execution time per item varies significantly, you should use the versions in
             the task interface.
             Parallel map
             ------------
             To load-balance :meth:`map`,simply use a LoadBalancedView:
             .. sourcecode:: ipython
                 In [62]: lview.block = True
                 In [63]: serial_result = map(lambda x:x**10, range(32))
                 In [64]: parallel_result = lview.map(lambda x:x**10, range(32))
                 In [65]: serial_result==parallel_result
                 Out[65]: True
             Parallel function decorator
             ---------------------------
             Parallel functions are just like normal function, but they can be called on
             sequences and *in parallel*. The multiengine interface provides a decorator
             that turns any Python function into a parallel function:
             .. sourcecode:: ipython
                 In [10]: @lview.parallel()
                    ....: def f(x):
                    ....:     return 10.0*x**4
                    ....:
                 In [11]: f.map(range(32))    # this is done in parallel
                 Out[11]: [0.0,10.0,160.0,...]
             .. _parallel_dependencies:
             Dependencies
             ============
             Often, pure atomic load-balancing is too primitive for your work. In these cases, you
             may want to associate some kind of `Dependency` that describes when, where, or whether
             a task can be run.  In IPython, we provide two types of dependencies:
             `Functional Dependencies`_ and `Graph Dependencies`_
             .. note::
                 It is important to note that the pure ZeroMQ scheduler does not support dependencies,
                 and you will see errors or warnings if you try to use dependencies with the pure
                 scheduler.
             Functional Dependencies
             -----------------------
             Functional dependencies are used to determine whether a given engine is capable of running
             a particular task.  This is implemented via a special :class:`Exception` class,
             :class:`UnmetDependency`, found in `IPython.parallel.error`.  Its use is very simple:
             if a task fails with an UnmetDependency exception, then the scheduler, instead of relaying
             the error up to the client like any other error, catches the error, and submits the task
             to a different engine.  This will repeat indefinitely, and a task will never be submitted
             to a given engine a second time.
             You can manually raise the :class:`UnmetDependency` yourself, but IPython has provided
             some decorators for facilitating this behavior.
             There are two decorators and a class used for functional dependencies:
             .. sourcecode:: ipython
                 In [9]: from IPython.parallel import depend, require, dependent
             @require
             ********
             The simplest sort of dependency is requiring that a Python module is available. The
             ``@require`` decorator lets you define a function that will only run on engines where names
             you specify are importable:
             .. sourcecode:: ipython
                 In [10]: @require('numpy', 'zmq')
                    ....: def myfunc():
                    ....:     return dostuff()
             Now, any time you apply :func:`myfunc`, the task will only run on a machine that has
             numpy and pyzmq available, and when :func:`myfunc` is called, numpy and zmq will be imported.
             @depend
             *******
             The ``@depend`` decorator lets you decorate any function with any *other* function to
             evaluate the dependency. The dependency function will be called at the start of the task,
             and if it returns ``False``, then the dependency will be considered unmet, and the task
             will be assigned to another engine. If the dependency returns *anything other than
             ``False``*, the rest of the task will continue.
             .. sourcecode:: ipython
                 In [10]: def platform_specific(plat):
                    ....:    import sys
                    ....:    return sys.platform == plat
                 In [11]: @depend(platform_specific, 'darwin')
                    ....: def mactask():
                    ....:    do_mac_stuff()
                 In [12]: @depend(platform_specific, 'nt')
                    ....: def wintask():
                    ....:    do_windows_stuff()
             In this case, any time you apply ``mytask``, it will only run on an OSX machine.
             ``@depend`` is just like ``apply``, in that it has a ``@depend(f,*args,**kwargs)``
             signature.
             dependents
             **********
             You don't have to use the decorators on your tasks, if for instance you may want
             to run tasks with a single function but varying dependencies, you can directly construct
             the :class:`dependent` object that the decorators use:
             .. sourcecode::ipython
                 In [13]: def mytask(*args):
                    ....:    dostuff()
                 In [14]: mactask = dependent(mytask, platform_specific, 'darwin')
                 # this is the same as decorating the declaration of mytask with @depend
                 # but you can do it again:
                 In [15]: wintask = dependent(mytask, platform_specific, 'nt')
                 # in general:
                 In [16]: t = dependent(f, g, *dargs, **dkwargs)
                 # is equivalent to:
                 In [17]: @depend(g, *dargs, **dkwargs)
                    ....: def t(a,b,c):
                    ....:     # contents of f
             Graph Dependencies
             ------------------
             Sometimes you want to restrict the time and/or location to run a given task as a function
             of the time and/or location of other tasks. This is implemented via a subclass of
             :class:`set`, called a :class:`Dependency`. A Dependency is just a set of `msg_ids`
             corresponding to tasks, and a few attributes to guide how to decide when the Dependency
             has been met.
             The switches we provide for interpreting whether a given dependency set has been met:
             any|all
                 Whether the dependency is considered met if *any* of the dependencies are done, or
                 only after *all* of them have finished.  This is set by a Dependency's :attr:`all`
                 boolean attribute, which defaults to ``True``.
             success [default: True]
                 Whether to consider tasks that succeeded as fulfilling dependencies.
             failure [default : False]
                 Whether to consider tasks that failed as fulfilling dependencies.
                 using `failure=True,success=False` is useful for setting up cleanup tasks, to be run
                 only when tasks have failed.
             Sometimes you want to run a task after another, but only if that task succeeded. In this case,
             ``success`` should be ``True`` and ``failure`` should be ``False``. However sometimes you may
             not care whether the task succeeds, and always want the second task to run, in which case you
             should use `success=failure=True`. The default behavior is to only use successes.
             There are other switches for interpretation that are made at the *task* level.  These are
             specified via keyword arguments to the client's :meth:`apply` method.
             after,follow
                 You may want to run a task *after* a given set of dependencies have been run and/or
                 run it *where* another set of dependencies are met. To support this, every task has an
                 `after` dependency to restrict time, and a `follow` dependency to restrict
                 destination.
             timeout
                 You may also want to set a time-limit for how long the scheduler should wait before a
                 task's dependencies are met. This is done via a `timeout`, which defaults to 0, which
                 indicates that the task should never timeout. If the timeout is reached, and the
                 scheduler still hasn't been able to assign the task to an engine, the task will fail
                 with a :class:`DependencyTimeout`.
             .. note::
                 Dependencies only work within the task scheduler. You cannot instruct a load-balanced
                 task to run after a job submitted via the MUX interface.
             The simplest form of Dependencies is with `all=True,success=True,failure=False`. In these cases,
             you can skip using Dependency objects, and just pass msg_ids or AsyncResult objects as the
             `follow` and `after` keywords to :meth:`client.apply`:
             .. sourcecode:: ipython
                 In [14]: client.block=False
                 In [15]: ar = lview.apply(f, args, kwargs)
                 In [16]: ar2 = lview.apply(f2)
                 In [17]: with lview.temp_flags(after=[ar,ar2]):
                    ....:    ar3 = lview.apply(f3)
                 In [18]: with lview.temp_flags(follow=[ar], timeout=2.5)
                    ....:    ar4 = lview.apply(f3)
             .. seealso::
                 Some parallel workloads can be described as a `Directed Acyclic Graph
                 <http://en.wikipedia.org/wiki/Directed_acyclic_graph>`_, or DAG. See :ref:`DAG
                 Dependencies <dag_dependencies>` for an example demonstrating how to use map a NetworkX DAG
                 onto task dependencies.
             Impossible Dependencies
             ***********************
             The schedulers do perform some analysis on graph dependencies to determine whether they
             are not possible to be met. If the scheduler does discover that a dependency cannot be
             met, then the task will fail with an :class:`ImpossibleDependency` error. This way, if the
             scheduler realized that a task can never be run, it won't sit indefinitely in the
             scheduler clogging the pipeline.
             The basic cases that are checked:
             * depending on nonexistent messages
             * `follow` dependencies were run on more than one machine and `all=True`
             * any dependencies failed and `all=True,success=True,failures=False`
             * all dependencies failed and `all=False,success=True,failure=False`
             .. warning::
                 This analysis has not been proven to be rigorous, so it is likely possible for tasks
                 to become impossible to run in obscure situations, so a timeout may be a good choice.
             Retries and Resubmit
             ====================
             Retries
             -------
             Another flag for tasks is `retries`.  This is an integer, specifying how many times
             a task should be resubmitted after failure.  This is useful for tasks that should still run
             if their engine was shutdown, or may have some statistical chance of failing.  The default
             is to not retry tasks.
             Resubmit
             --------
             Sometimes you may want to re-run a task. This could be because it failed for some reason, and
             you have fixed the error, or because you want to restore the cluster to an interrupted state.
             For this, the :class:`Client` has a :meth:`rc.resubmit` method.  This simply takes one or more
             msg_ids, and returns an :class:`AsyncHubResult` for the result(s).  You cannot resubmit
             a task that is pending - only those that have finished, either successful or unsuccessful.
             .. _parallel_schedulers:
             Schedulers
             ==========
             There are a variety of valid ways to determine where jobs should be assigned in a
             load-balancing situation.  In IPython, we support several standard schemes, and
             even make it easy to define your own.  The scheme can be selected via the ``scheme``
             argument to :command:`ipcontroller`, or in the :attr:`TaskScheduler.schemename` attribute
             of a controller config object.
             The built-in routing schemes:
             To select one of these schemes, simply do::
                 $ ipcontroller --scheme=<schemename>
                 for instance:
                 $ ipcontroller --scheme=lru
             lru: Least Recently Used
                 Always assign work to the least-recently-used engine.  A close relative of
                 round-robin, it will be fair with respect to the number of tasks, agnostic
                 with respect to runtime of each task.
             plainrandom: Plain Random
                 Randomly picks an engine on which to run.
             twobin: Two-Bin Random
                 **Requires numpy**
                 Pick two engines at random, and use the LRU of the two. This is known to be better
                 than plain random in many cases, but requires a small amount of computation.
             leastload: Least Load
                 **This is the default scheme**
                 Always assign tasks to the engine with the fewest outstanding tasks (LRU breaks tie).
             weighted: Weighted Two-Bin Random
                 **Requires numpy**
                 Pick two engines at random using the number of outstanding tasks as inverse weights,
                 and use the one with the lower load.
             Greedy Assignment
             -----------------
-            Tasks are assigned greedily as they are submitted. If their dependencies are
+            Tasks can be assigned greedily as they are submitted. If their dependencies are
             met, they will be assigned to an engine right away, and multiple tasks can be
             assigned to an engine at a given time. This limit is set with the
-            ``TaskScheduler.hwm`` (high water mark) configurable:
+            ``TaskScheduler.hwm`` (high water mark) configurable in your
+            :file:`ipcontroller_config.py` config file, with:
             .. sourcecode:: python
                 # the most common choices are:
-                c.TaskSheduler.hwm = 0 # (minimal latency, default in IPython ≤ 0.12)
+                c.TaskSheduler.hwm = 0 # (minimal latency, default in IPython < 0.13)
                 # or
-                c.TaskScheduler.hwm = 1 # (most-informed balancing, default in > 0.12)
+                c.TaskScheduler.hwm = 1 # (most-informed balancing, default in ≥ 0.13)
-            In IPython ≤ 0.12,the default is 0, or no-limit. That is, there is no limit to the number of
+            In IPython < 0.13, the default is 0, or no-limit. That is, there is no limit to the number of
             tasks that can be outstanding on a given engine. This greatly benefits the
             latency of execution, because network traffic can be hidden behind computation.
             However, this means that workload is assigned without knowledge of how long
             each task might take, and can result in poor load-balancing, particularly for
             submitting a collection of heterogeneous tasks all at once. You can limit this
             effect by setting hwm to a positive integer, 1 being maximum load-balancing (a
             task will never be waiting if there is an idle engine), and any larger number
-            being a compromise between load-balance and latency-hiding.
+            being a compromise between load-balancing and latency-hiding.
             In practice, some users have been confused by having this optimization on by
-            default, and the default value has been changed to 1. This can be slower,
+            default, so the default value has been changed to 1 in IPython 0.13. This can be slower,
             but has more obvious behavior and won't result in assigning too many tasks to
             some engines in heterogeneous cases.
             Pure ZMQ Scheduler
             ------------------
             For maximum throughput, the 'pure' scheme is not Python at all, but a C-level
             :class:`MonitoredQueue` from PyZMQ, which uses a ZeroMQ ``DEALER`` socket to perform all
             load-balancing. This scheduler does not support any of the advanced features of the Python
             :class:`.Scheduler`.
             Disabled features when using the ZMQ Scheduler:
             * Engine unregistration
                 Task farming will be disabled if an engine unregisters.
                 Further, if an engine is unregistered during computation, the scheduler may not recover.
             * Dependencies
                 Since there is no Python logic inside the Scheduler, routing decisions cannot be made
                 based on message content.
             * Early destination notification
                 The Python schedulers know which engine gets which task, and notify the Hub.  This
                 allows graceful handling of Engines coming and going.  There is no way to know
                 where ZeroMQ messages have gone, so there is no way to know what tasks are on which
                 engine until they *finish*.  This makes recovery from engine shutdown very difficult.
             .. note::
                 TODO: performance comparisons
             More details
             ============
             The :class:`LoadBalancedView` has many more powerful features that allow quite a bit
             of flexibility in how tasks are defined and run. The next places to look are
             in the following classes:
             * :class:`~IPython.parallel.client.view.LoadBalancedView`
             * :class:`~IPython.parallel.client.asyncresult.AsyncResult`
             * :meth:`~IPython.parallel.client.view.LoadBalancedView.apply`
             * :mod:`~IPython.parallel.controller.dependency`
             The following is an overview of how to use these classes together:
 . Create a :class:`Client` and :class:`LoadBalancedView`
 . Define some functions to be run as tasks
 . Submit your tasks to using the :meth:`apply` method of your
                :class:`LoadBalancedView` instance.
 . Use :meth:`.Client.get_result` to get the results of the
                tasks, or use the :meth:`AsyncResult.get` method of the results to wait
                for and then receive the results.
             .. seealso::
                 A demo of :ref:`DAG Dependencies <dag_dependencies>` with NetworkX and IPython.