upstream/mercurial-mirror Commit - r21066:5ecfe76d

1

# bundle2.py - generic container format to transmit arbitrary data.

1

# bundle2.py - generic container format to transmit arbitrary data.

2

#

2

#

3

4

#

4

#

5

# This software may be used and distributed according to the terms of the

5

# This software may be used and distributed according to the terms of the

6

# GNU General Public License version 2 or any later version.

6

# GNU General Public License version 2 or any later version.

7

"""Handling of the new bundle2 format

7

"""Handling of the new bundle2 format

8

9

The goal of bundle2 is to act as an atomically packet to transmit a set of

9

The goal of bundle2 is to act as an atomically packet to transmit a set of

10

payloads in an application agnostic way. It consist in a sequence of "parts"

10

payloads in an application agnostic way. It consist in a sequence of "parts"

11

that will be handed to and processed by the application layer.

11

that will be handed to and processed by the application layer.

12

13

14

General format architecture

14

General format architecture

15

===========================

15

===========================

16

17

The format is architectured as follow

17

The format is architectured as follow

18

19

- magic string

19

- magic string

20

- stream level parameters

20

- stream level parameters

21

- payload parts (any number)

21

- payload parts (any number)

22

- end of stream marker.

22

- end of stream marker.

23

24

the Binary format

24

the Binary format

25

============================

25

============================

26

27

All numbers are unsigned and big-endian.

27

All numbers are unsigned and big-endian.

28

29

stream level parameters

29

stream level parameters

30

------------------------

30

------------------------

31

32

Binary format is as follow

32

Binary format is as follow

33

34

:params size: (16 bits integer)

34

:params size: (16 bits integer)

35

36

The total number of Bytes used by the parameters

36

The total number of Bytes used by the parameters

37

38

:params value: arbitrary number of Bytes

38

:params value: arbitrary number of Bytes

39

40

A blob of `params size` containing the serialized version of all stream level

40

A blob of `params size` containing the serialized version of all stream level

41

parameters.

41

parameters.

42

43

The blob contains a space separated list of parameters. Parameters with value

43

The blob contains a space separated list of parameters. Parameters with value

44

are stored in the form `<name>=<value>`. Both name and value are urlquoted.

44

are stored in the form `<name>=<value>`. Both name and value are urlquoted.

45

46

Empty name are obviously forbidden.

46

Empty name are obviously forbidden.

47

48

Name MUST start with a letter. If this first letter is lower case, the

48

Name MUST start with a letter. If this first letter is lower case, the

49

parameter is advisory and can be safely ignored. However when the first

49

parameter is advisory and can be safely ignored. However when the first

50

letter is capital, the parameter is mandatory and the bundling process MUST

50

letter is capital, the parameter is mandatory and the bundling process MUST

51

stop if he is not able to proceed it.

51

stop if he is not able to proceed it.

52

53

Stream parameters use a simple textual format for two main reasons:

53

Stream parameters use a simple textual format for two main reasons:

54

55

- Stream level parameters should remain simple and we want to discourage any

55

- Stream level parameters should remain simple and we want to discourage any

56

crazy usage.

56

crazy usage.

57

- Textual data allow easy human inspection of a bundle2 header in case of

57

- Textual data allow easy human inspection of a bundle2 header in case of

58

troubles.

58

troubles.

59

60

Any Applicative level options MUST go into a bundle2 part instead.

60

Any Applicative level options MUST go into a bundle2 part instead.

61

62

Payload part

62

Payload part

63

------------------------

63

------------------------

64

65

Binary format is as follow

65

Binary format is as follow

66

67

:header size: (16 bits inter)

67

:header size: (16 bits inter)

68

69

The total number of Bytes used by the part headers. When the header is empty

69

The total number of Bytes used by the part headers. When the header is empty

70

(size = 0) this is interpreted as the end of stream marker.

70

(size = 0) this is interpreted as the end of stream marker.

71

72

:header:

72

:header:

73

74

The header defines how to interpret the part. It contains two piece of

74

The header defines how to interpret the part. It contains two piece of

75

data: the part type, and the part parameters.

75

data: the part type, and the part parameters.

76

77

The part type is used to route an application level handler, that can

77

The part type is used to route an application level handler, that can

78

interpret payload.

78

interpret payload.

79

80

Part parameters are passed to the application level handler. They are

80

Part parameters are passed to the application level handler. They are

81

meant to convey information that will help the application level object to

81

meant to convey information that will help the application level object to

82

interpret the part payload.

82

interpret the part payload.

83

84

The binary format of the header is has follow

84

The binary format of the header is has follow

85

86

:typesize: (one byte)

86

:typesize: (one byte)

87

88

:parttype: alphanumerical part name

88

:parttype: alphanumerical part name

89

90

:partid: A 32bits integer (unique in the bundle) that can be used to refer

90

:partid: A 32bits integer (unique in the bundle) that can be used to refer

91

to this part.

91

to this part.

92

93

:parameters:

93

:parameters:

94

95

Part's parameter may have arbitrary content, the binary structure is::

95

Part's parameter may have arbitrary content, the binary structure is::

96

97

<mandatory-count><advisory-count><param-sizes><param-data>

97

<mandatory-count><advisory-count><param-sizes><param-data>

98

99

:mandatory-count: 1 byte, number of mandatory parameters

99

:mandatory-count: 1 byte, number of mandatory parameters

100

101

:advisory-count: 1 byte, number of advisory parameters

101

:advisory-count: 1 byte, number of advisory parameters

102

103

:param-sizes:

103

:param-sizes:

104

105

N couple of bytes, where N is the total number of parameters. Each

105

N couple of bytes, where N is the total number of parameters. Each

106

couple contains (<size-of-key>, <size-of-value) for one parameter.

106

couple contains (<size-of-key>, <size-of-value) for one parameter.

107

108

:param-data:

108

:param-data:

109

110

A blob of bytes from which each parameter key and value can be

110

A blob of bytes from which each parameter key and value can be

111

retrieved using the list of size couples stored in the previous

111

retrieved using the list of size couples stored in the previous

112

field.

112

field.

113

114

Mandatory parameters comes first, then the advisory ones.

114

Mandatory parameters comes first, then the advisory ones.

115

116

:payload:

116

:payload:

117

118

payload is a series of `<chunksize><chunkdata>`.

118

payload is a series of `<chunksize><chunkdata>`.

119

120

`chunksize` is a 32 bits integer, `chunkdata` are plain bytes (as much as

120

`chunksize` is a 32 bits integer, `chunkdata` are plain bytes (as much as

121

`chunksize` says)` The payload part is concluded by a zero size chunk.

121

`chunksize` says)` The payload part is concluded by a zero size chunk.

122

123

The current implementation always produces either zero or one chunk.

123

The current implementation always produces either zero or one chunk.

124

This is an implementation limitation that will ultimately be lifted.

124

This is an implementation limitation that will ultimately be lifted.

125

126

Bundle processing

126

Bundle processing

127

============================

127

============================

128

129

Each part is processed in order using a "part handler". Handler are registered

129

Each part is processed in order using a "part handler". Handler are registered

130

for a certain part type.

130

for a certain part type.

131

132

The matching of a part to its handler is case insensitive. The case of the

132

The matching of a part to its handler is case insensitive. The case of the

133

part type is used to know if a part is mandatory or advisory. If the Part type

133

part type is used to know if a part is mandatory or advisory. If the Part type

134

contains any uppercase char it is considered mandatory. When no handler is

134

contains any uppercase char it is considered mandatory. When no handler is

135

known for a Mandatory part, the process is aborted and an exception is raised.

135

known for a Mandatory part, the process is aborted and an exception is raised.

136

If the part is advisory and no handler is known, the part is ignored. When the

136

If the part is advisory and no handler is known, the part is ignored. When the

137

process is aborted, the full bundle is still read from the stream to keep the

137

process is aborted, the full bundle is still read from the stream to keep the

138

channel usable. But none of the part read from an abort are processed. In the

138

channel usable. But none of the part read from an abort are processed. In the

139

future, dropping the stream may become an option for channel we do not care to

139

future, dropping the stream may become an option for channel we do not care to

140

preserve.

140

preserve.

141

"""

141

"""

142

143

import util

143

import util

144

import struct

144

import struct

145

import urllib

145

import urllib

146

import string

146

import string

147

148

import changegroup

148

import changegroup

149

from i18n import _

149

from i18n import _

150

151

_pack = struct.pack

151

_pack = struct.pack

152

_unpack = struct.unpack

152

_unpack = struct.unpack

153

154

_magicstring = 'HG20'

154

_magicstring = 'HG20'

155

156

_fstreamparamsize = '>H'

156

_fstreamparamsize = '>H'

157

_fpartheadersize = '>H'

157

_fpartheadersize = '>H'

158

_fparttypesize = '>B'

158

_fparttypesize = '>B'

159

_fpartid = '>I'

159

_fpartid = '>I'

160

_fpayloadsize = '>I'

160

_fpayloadsize = '>I'

161

_fpartparamcount = '>BB'

161

_fpartparamcount = '>BB'

162

163

preferedchunksize = 4096

163

preferedchunksize = 4096

164

165

def _makefpartparamsizes(nbparams):

165

def _makefpartparamsizes(nbparams):

166

"""return a struct format to read part parameter sizes

166

"""return a struct format to read part parameter sizes

167

168

The number parameters is variable so we need to build that format

168

The number parameters is variable so we need to build that format

169

dynamically.

169

dynamically.

170

"""

170

"""

171

return '>'+('BB'*nbparams)

171

return '>'+('BB'*nbparams)

172

173

parthandlermapping = {}

173

parthandlermapping = {}

174

175

def parthandler(parttype):

175

def parthandler(parttype):

176

"""decorator that register a function as a bundle2 part handler

176

"""decorator that register a function as a bundle2 part handler

177

178

eg::

178

eg::

179

180

@parthandler('myparttype')

180

@parthandler('myparttype')

181

def myparttypehandler(...):

181

def myparttypehandler(...):

182

'''process a part of type "my part".'''

182

'''process a part of type "my part".'''

183

...

183

...

184

"""

184

"""

185

def _decorator(func):

185

def _decorator(func):

186

lparttype = parttype.lower() # enforce lower case matching.

186

lparttype = parttype.lower() # enforce lower case matching.

187

assert lparttype not in parthandlermapping

187

assert lparttype not in parthandlermapping

188

parthandlermapping[lparttype] = func

188

parthandlermapping[lparttype] = func

189

return func

189

return func

190

return _decorator

190

return _decorator

191

192

class unbundlerecords(object):

192

class unbundlerecords(object):

193

"""keep record of what happens during and unbundle

193

"""keep record of what happens during and unbundle

194

195

New records are added using `records.add('cat', obj)`. Where 'cat' is a

195

New records are added using `records.add('cat', obj)`. Where 'cat' is a

196

category of record and obj is an arbitrary object.

196

category of record and obj is an arbitrary object.

197

198

`records['cat']` will return all entries of this category 'cat'.

198

`records['cat']` will return all entries of this category 'cat'.

199

200

Iterating on the object itself will yield `('category', obj)` tuples

200

Iterating on the object itself will yield `('category', obj)` tuples

201

for all entries.

201

for all entries.

202

203

All iterations happens in chronological order.

203

All iterations happens in chronological order.

204

"""

204

"""

205

206

def __init__(self):

206

def __init__(self):

207

self._categories = {}

207

self._categories = {}

208

self._sequences = []

208

self._sequences = []

209

self._replies = {}

209

self._replies = {}

210

211

def add(self, category, entry, inreplyto=None):

211

def add(self, category, entry, inreplyto=None):

212

"""add a new record of a given category.

212

"""add a new record of a given category.

213

214

The entry can then be retrieved in the list returned by

214

The entry can then be retrieved in the list returned by

215

self['category']."""

215

self['category']."""

216

self._categories.setdefault(category, []).append(entry)

216

self._categories.setdefault(category, []).append(entry)

217

self._sequences.append((category, entry))

217

self._sequences.append((category, entry))

218

if inreplyto is not None:

218

if inreplyto is not None:

219

self.getreplies(inreplyto).add(category, entry)

219

self.getreplies(inreplyto).add(category, entry)

220

221

def getreplies(self, partid):

221

def getreplies(self, partid):

222

"""get the subrecords that replies to a specific part"""

222

"""get the subrecords that replies to a specific part"""

223

return self._replies.setdefault(partid, unbundlerecords())

223

return self._replies.setdefault(partid, unbundlerecords())

224

225

def __getitem__(self, cat):

225

def __getitem__(self, cat):

226

return tuple(self._categories.get(cat, ()))

226

return tuple(self._categories.get(cat, ()))

227

228

def __iter__(self):

228

def __iter__(self):

229

return iter(self._sequences)

229

return iter(self._sequences)

230

231

def __len__(self):

231

def __len__(self):

232

return len(self._sequences)

232

return len(self._sequences)

233

234

def __nonzero__(self):

234

def __nonzero__(self):

235

return bool(self._sequences)

235

return bool(self._sequences)

236

237

class bundleoperation(object):

237

class bundleoperation(object):

238

"""an object that represents a single bundling process

238

"""an object that represents a single bundling process

239

240

Its purpose is to carry unbundle-related objects and states.

240

Its purpose is to carry unbundle-related objects and states.

241

242

A new object should be created at the beginning of each bundle processing.

242

A new object should be created at the beginning of each bundle processing.

243

The object is to be returned by the processing function.

243

The object is to be returned by the processing function.

244

245

The object has very little content now it will ultimately contain:

245

The object has very little content now it will ultimately contain:

246

* an access to the repo the bundle is applied to,

246

* an access to the repo the bundle is applied to,

247

* a ui object,

247

* a ui object,

248

* a way to retrieve a transaction to add changes to the repo,

248

* a way to retrieve a transaction to add changes to the repo,

249

* a way to record the result of processing each part,

249

* a way to record the result of processing each part,

250

* a way to construct a bundle response when applicable.

250

* a way to construct a bundle response when applicable.

251

"""

251

"""

252

253

def __init__(self, repo, transactiongetter):

253

def __init__(self, repo, transactiongetter):

254

self.repo = repo

254

self.repo = repo

255

self.ui = repo.ui

255

self.ui = repo.ui

256

self.records = unbundlerecords()

256

self.records = unbundlerecords()

257

self.gettransaction = transactiongetter

257

self.gettransaction = transactiongetter

258

self.reply = None

258

self.reply = None

259

260

class TransactionUnavailable(RuntimeError):

260

class TransactionUnavailable(RuntimeError):

261

pass

261

pass

262

263

def _notransaction():

263

def _notransaction():

264

"""default method to get a transaction while processing a bundle

264

"""default method to get a transaction while processing a bundle

265

266

Raise an exception to highlight the fact that no transaction was expected

266

Raise an exception to highlight the fact that no transaction was expected

267

to be created"""

267

to be created"""

268

raise TransactionUnavailable()

268

raise TransactionUnavailable()

269

270

def processbundle(repo, unbundler, transactiongetter=_notransaction):

270

def processbundle(repo, unbundler, transactiongetter=_notransaction):

271

"""This function process a bundle, apply effect to/from a repo

271

"""This function process a bundle, apply effect to/from a repo

272

273

It iterates over each part then searches for and uses the proper handling

273

It iterates over each part then searches for and uses the proper handling

274

code to process the part. Parts are processed in order.

274

code to process the part. Parts are processed in order.

275

276

This is very early version of this function that will be strongly reworked

276

This is very early version of this function that will be strongly reworked

277

before final usage.

277

before final usage.

278

279

Unknown Mandatory part will abort the process.

279

Unknown Mandatory part will abort the process.

280

"""

280

"""

281

op = bundleoperation(repo, transactiongetter)

281

op = bundleoperation(repo, transactiongetter)

282

# todo:

282

# todo:

283

# - only create reply bundle if requested.

283

# - only create reply bundle if requested.

284

op.reply = bundle20(op.ui)

284

op.reply = bundle20(op.ui)

285

# todo:

285

# todo:

286

# - replace this is a init function soon.

286

# - replace this is a init function soon.

287

# - exception catching

287

# - exception catching

288

unbundler.params

288

unbundler.params

289

iterparts = iter(unbundler)

289

iterparts = iter(unbundler)

290

part = None

290

part = None

291

try:

291

try:

292

for part in iterparts:

292

for part in iterparts:

293

parttype = part.type

293

parttype = part.type

294

# part key are matched lower case

294

# part key are matched lower case

295

key = parttype.lower()

295

key = parttype.lower()

296

try:

296

try:

297

handler = parthandlermapping[key]

297

handler = parthandlermapping[key]

298

op.ui.debug('found a handler for part %r\n' % parttype)

298

op.ui.debug('found a handler for part %r\n' % parttype)

299

except KeyError:

299

except KeyError:

300

if key != parttype: # mandatory parts

300

if key != parttype: # mandatory parts

301

# todo:

301

# todo:

302

# - use a more precise exception

302

# - use a more precise exception

303

raise

303

raise

304

op.ui.debug('ignoring unknown advisory part %r\n' % key)

304

op.ui.debug('ignoring unknown advisory part %r\n' % key)

305

# consuming the part

305

# consuming the part

306

part.read()

306

part.read()

307

continue

307

continue

308

309

# handler is called outside the above try block so that we don't

309

# handler is called outside the above try block so that we don't

310

# risk catching KeyErrors from anything other than the

310

# risk catching KeyErrors from anything other than the

311

# parthandlermapping lookup (any KeyError raised by handler()

311

# parthandlermapping lookup (any KeyError raised by handler()

312

# itself represents a defect of a different variety).

312

# itself represents a defect of a different variety).

313

handler(op, part)

313

handler(op, part)

314

part.read()

314

part.read()

315

except Exception:

315

except Exception:

316

if part is not None:

316

if part is not None:

317

# consume the bundle content

317

# consume the bundle content

318

part.read()

318

part.read()

319

for part in iterparts:

319

for part in iterparts:

320

# consume the bundle content

320

# consume the bundle content

321

part.read()

321

part.read()

322

raise

322

raise

323

return op

323

return op

324

325

class bundle20(object):

325

class bundle20(object):

326

"""represent an outgoing bundle2 container

326

"""represent an outgoing bundle2 container

327

328

Use the `addparam` method to add stream level parameter. and `addpart` to

328

Use the `addparam` method to add stream level parameter. and `addpart` to

329

populate it. Then call `getchunks` to retrieve all the binary chunks of

329

populate it. Then call `getchunks` to retrieve all the binary chunks of

330

data that compose the bundle2 container."""

330

data that compose the bundle2 container."""

331

332

def __init__(self, ui):

332

def __init__(self, ui):

333

self.ui = ui

333

self.ui = ui

334

self._params = []

334

self._params = []

335

self._parts = []

335

self._parts = []

336

337

def addparam(self, name, value=None):

337

def addparam(self, name, value=None):

338

"""add a stream level parameter"""

338

"""add a stream level parameter"""

339

if not name:

339

if not name:

340

raise ValueError('empty parameter name')

340

raise ValueError('empty parameter name')

341

if name[0] not in string.letters:

341

if name[0] not in string.letters:

342

raise ValueError('non letter first character: %r' % name)

342

raise ValueError('non letter first character: %r' % name)

343

self._params.append((name, value))

343

self._params.append((name, value))

344

345

def addpart(self, part):

345

def addpart(self, part):

346

"""add a new part to the bundle2 container

346

"""add a new part to the bundle2 container

347

348

Parts contains the actual applicative payload."""

348

Parts contains the actual applicative payload."""

349

assert part.id is None

349

assert part.id is None

350

part.id = len(self._parts) # very cheap counter

350

part.id = len(self._parts) # very cheap counter

351

self._parts.append(part)

351

self._parts.append(part)

352

353

def getchunks(self):

353

def getchunks(self):

354

self.ui.debug('start emission of %s stream\n' % _magicstring)

354

self.ui.debug('start emission of %s stream\n' % _magicstring)

355

yield _magicstring

355

yield _magicstring

356

param = self._paramchunk()

356

param = self._paramchunk()

357

self.ui.debug('bundle parameter: %s\n' % param)

357

self.ui.debug('bundle parameter: %s\n' % param)

358

yield _pack(_fstreamparamsize, len(param))

358

yield _pack(_fstreamparamsize, len(param))

359

if param:

359

if param:

360

yield param

360

yield param

361

362

self.ui.debug('start of parts\n')

362

self.ui.debug('start of parts\n')

363

for part in self._parts:

363

for part in self._parts:

364

self.ui.debug('bundle part: "%s"\n' % part.type)

364

self.ui.debug('bundle part: "%s"\n' % part.type)

365

for chunk in part.getchunks():

365

for chunk in part.getchunks():

366

yield chunk

366

yield chunk

367

self.ui.debug('end of bundle\n')

367

self.ui.debug('end of bundle\n')

368

yield '\0\0'

368

yield '\0\0'

369

370

def _paramchunk(self):

370

def _paramchunk(self):

371

"""return a encoded version of all stream parameters"""

371

"""return a encoded version of all stream parameters"""

372

blocks = []

372

blocks = []

373

for par, value in self._params:

373

for par, value in self._params:

374

par = urllib.quote(par)

374

par = urllib.quote(par)

375

if value is not None:

375

if value is not None:

376

value = urllib.quote(value)

376

value = urllib.quote(value)

377

par = '%s=%s' % (par, value)

377

par = '%s=%s' % (par, value)

378

blocks.append(par)

378

blocks.append(par)

379

return ' '.join(blocks)

379

return ' '.join(blocks)

380

381

class unpackermixin(object):

381

class unpackermixin(object):

382

"""A mixin to extract bytes and struct data from a stream"""

382

"""A mixin to extract bytes and struct data from a stream"""

383

384

def __init__(self, fp):

384

def __init__(self, fp):

385

self._fp = fp

385

self._fp = fp

386

387

def _unpack(self, format):

387

def _unpack(self, format):

388

"""unpack this struct format from the stream"""

388

"""unpack this struct format from the stream"""

389

data = self._readexact(struct.calcsize(format))

389

data = self._readexact(struct.calcsize(format))

390

return _unpack(format, data)

390

return _unpack(format, data)

391

392

def _readexact(self, size):

392

def _readexact(self, size):

393

"""read exactly <size> bytes from the stream"""

393

"""read exactly <size> bytes from the stream"""

394

return changegroup.readexactly(self._fp, size)

394

return changegroup.readexactly(self._fp, size)

395

396

397

class unbundle20(unpackermixin):

397

class unbundle20(unpackermixin):

398

"""interpret a bundle2 stream

398

"""interpret a bundle2 stream

399

400

(this will eventually yield parts)"""

400

(this will eventually yield parts)"""

401

402

def __init__(self, ui, fp):

402

def __init__(self, ui, fp, header=None):

403

"""If header is specified, we do not read it out of the stream."""

403

self.ui = ui

404

self.ui = ui

404

super(unbundle20, self).__init__(fp)

405

super(unbundle20, self).__init__(fp)

405

header = self._readexact(4)

406

if header is None:

406

magic, version = header[0:2], header[2:4]

407

header = self._readexact(4)

407

if magic != 'HG':

408

magic, version = header[0:2], header[2:4]

408

raise util.Abort(_('not a Mercurial bundle'))

409

if magic != 'HG':

409

if version != '20':

410

raise util.Abort(_('not a Mercurial bundle'))

410

raise util.Abort(_('unknown bundle version %s') % version)

411

if version != '20':

412

raise util.Abort(_('unknown bundle version %s') % version)

411

self.ui.debug('start processing of %s stream\n' % header)

413

self.ui.debug('start processing of %s stream\n' % header)

412

414

413

@util.propertycache

415

@util.propertycache

414

def params(self):

416

def params(self):

415

"""dictionary of stream level parameters"""

417

"""dictionary of stream level parameters"""

416

self.ui.debug('reading bundle2 stream parameters\n')

418

self.ui.debug('reading bundle2 stream parameters\n')

417

params = {}

419

params = {}

418

paramssize = self._unpack(_fstreamparamsize)[0]

420

paramssize = self._unpack(_fstreamparamsize)[0]

419

if paramssize:

421

if paramssize:

420

for p in self._readexact(paramssize).split(' '):

422

for p in self._readexact(paramssize).split(' '):

421

p = p.split('=', 1)

423

p = p.split('=', 1)

422

p = [urllib.unquote(i) for i in p]

424

p = [urllib.unquote(i) for i in p]

423

if len(p) < 2:

425

if len(p) < 2:

424

p.append(None)

426

p.append(None)

425

self._processparam(*p)

427

self._processparam(*p)

426

params[p[0]] = p[1]

428

params[p[0]] = p[1]

427

return params

429

return params

428

430

429

def _processparam(self, name, value):

431

def _processparam(self, name, value):

430

"""process a parameter, applying its effect if needed

432

"""process a parameter, applying its effect if needed

431

433

432

Parameter starting with a lower case letter are advisory and will be

434

Parameter starting with a lower case letter are advisory and will be

433

ignored when unknown. Those starting with an upper case letter are

435

ignored when unknown. Those starting with an upper case letter are

434

mandatory and will this function will raise a KeyError when unknown.

436

mandatory and will this function will raise a KeyError when unknown.

435

437

436

Note: no option are currently supported. Any input will be either

438

Note: no option are currently supported. Any input will be either

437

ignored or failing.

439

ignored or failing.

438

"""

440

"""

439

if not name:

441

if not name:

440

raise ValueError('empty parameter name')

442

raise ValueError('empty parameter name')

441

if name[0] not in string.letters:

443

if name[0] not in string.letters:

442

raise ValueError('non letter first character: %r' % name)

444

raise ValueError('non letter first character: %r' % name)

443

# Some logic will be later added here to try to process the option for

445

# Some logic will be later added here to try to process the option for

444

# a dict of known parameter.

446

# a dict of known parameter.

445

if name[0].islower():

447

if name[0].islower():

446

self.ui.debug("ignoring unknown parameter %r\n" % name)

448

self.ui.debug("ignoring unknown parameter %r\n" % name)

447

else:

449

else:

448

raise KeyError(name)

450

raise KeyError(name)

449

451

450

452

451

def __iter__(self):

453

def __iter__(self):

452

"""yield all parts contained in the stream"""

454

"""yield all parts contained in the stream"""

453

# make sure param have been loaded

455

# make sure param have been loaded

454

self.params

456

self.params

455

self.ui.debug('start extraction of bundle2 parts\n')

457

self.ui.debug('start extraction of bundle2 parts\n')

456

headerblock = self._readpartheader()

458

headerblock = self._readpartheader()

457

while headerblock is not None:

459

while headerblock is not None:

458

part = unbundlepart(self.ui, headerblock, self._fp)

460

part = unbundlepart(self.ui, headerblock, self._fp)

459

yield part

461

yield part

460

headerblock = self._readpartheader()

462

headerblock = self._readpartheader()

461

self.ui.debug('end of bundle2 stream\n')

463

self.ui.debug('end of bundle2 stream\n')

462

464

463

def _readpartheader(self):

465

def _readpartheader(self):

464

"""reads a part header size and return the bytes blob

466

"""reads a part header size and return the bytes blob

465

467

466

returns None if empty"""

468

returns None if empty"""

467

headersize = self._unpack(_fpartheadersize)[0]

469

headersize = self._unpack(_fpartheadersize)[0]

468

self.ui.debug('part header size: %i\n' % headersize)

470

self.ui.debug('part header size: %i\n' % headersize)

469

if headersize:

471

if headersize:

470

return self._readexact(headersize)

472

return self._readexact(headersize)

471

return None

473

return None

472

474

473

475

474

class bundlepart(object):

476

class bundlepart(object):

475

"""A bundle2 part contains application level payload

477

"""A bundle2 part contains application level payload

476

478

477

The part `type` is used to route the part to the application level

479

The part `type` is used to route the part to the application level

478

handler.

480

handler.

479

"""

481

"""

480

482

481

def __init__(self, parttype, mandatoryparams=(), advisoryparams=(),

483

def __init__(self, parttype, mandatoryparams=(), advisoryparams=(),

482

data=''):

484

data=''):

483

self.id = None

485

self.id = None

484

self.type = parttype

486

self.type = parttype

485

self.data = data

487

self.data = data

486

self.mandatoryparams = mandatoryparams

488

self.mandatoryparams = mandatoryparams

487

self.advisoryparams = advisoryparams

489

self.advisoryparams = advisoryparams

488

490

489

def getchunks(self):

491

def getchunks(self):

490

#### header

492

#### header

491

## parttype

493

## parttype

492

header = [_pack(_fparttypesize, len(self.type)),

494

header = [_pack(_fparttypesize, len(self.type)),

493

self.type, _pack(_fpartid, self.id),

495

self.type, _pack(_fpartid, self.id),

494

]

496

]

495

## parameters

497

## parameters

496

# count

498

# count

497

manpar = self.mandatoryparams

499

manpar = self.mandatoryparams

498

advpar = self.advisoryparams

500

advpar = self.advisoryparams

499

header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))

501

header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))

500

# size

502

# size

501

parsizes = []

503

parsizes = []

502

for key, value in manpar:

504

for key, value in manpar:

503

parsizes.append(len(key))

505

parsizes.append(len(key))

504

parsizes.append(len(value))

506

parsizes.append(len(value))

505

for key, value in advpar:

507

for key, value in advpar:

506

parsizes.append(len(key))

508

parsizes.append(len(key))

507

parsizes.append(len(value))

509

parsizes.append(len(value))

508

paramsizes = _pack(_makefpartparamsizes(len(parsizes) / 2), *parsizes)

510

paramsizes = _pack(_makefpartparamsizes(len(parsizes) / 2), *parsizes)

509

header.append(paramsizes)

511

header.append(paramsizes)

510

# key, value

512

# key, value

511

for key, value in manpar:

513

for key, value in manpar:

512

header.append(key)

514

header.append(key)

513

header.append(value)

515

header.append(value)

514

for key, value in advpar:

516

for key, value in advpar:

515

header.append(key)

517

header.append(key)

516

header.append(value)

518

header.append(value)

517

## finalize header

519

## finalize header

518

headerchunk = ''.join(header)

520

headerchunk = ''.join(header)

519

yield _pack(_fpartheadersize, len(headerchunk))

521

yield _pack(_fpartheadersize, len(headerchunk))

520

yield headerchunk

522

yield headerchunk

521

## payload

523

## payload

522

for chunk in self._payloadchunks():

524

for chunk in self._payloadchunks():

523

yield _pack(_fpayloadsize, len(chunk))

525

yield _pack(_fpayloadsize, len(chunk))

524

yield chunk

526

yield chunk

525

# end of payload

527

# end of payload

526

yield _pack(_fpayloadsize, 0)

528

yield _pack(_fpayloadsize, 0)

527

529

528

def _payloadchunks(self):

530

def _payloadchunks(self):

529

"""yield chunks of a the part payload

531

"""yield chunks of a the part payload

530

532

531

Exists to handle the different methods to provide data to a part."""

533

Exists to handle the different methods to provide data to a part."""

532

# we only support fixed size data now.

534

# we only support fixed size data now.

533

# This will be improved in the future.

535

# This will be improved in the future.

534

if util.safehasattr(self.data, 'next'):

536

if util.safehasattr(self.data, 'next'):

535

buff = util.chunkbuffer(self.data)

537

buff = util.chunkbuffer(self.data)

536

chunk = buff.read(preferedchunksize)

538

chunk = buff.read(preferedchunksize)

537

while chunk:

539

while chunk:

538

yield chunk

540

yield chunk

539

chunk = buff.read(preferedchunksize)

541

chunk = buff.read(preferedchunksize)

540

elif len(self.data):

542

elif len(self.data):

541

yield self.data

543

yield self.data

542

544

543

class unbundlepart(unpackermixin):

545

class unbundlepart(unpackermixin):

544

"""a bundle part read from a bundle"""

546

"""a bundle part read from a bundle"""

545

547

546

def __init__(self, ui, header, fp):

548

def __init__(self, ui, header, fp):

547

super(unbundlepart, self).__init__(fp)

549

super(unbundlepart, self).__init__(fp)

548

self.ui = ui

550

self.ui = ui

549

# unbundle state attr

551

# unbundle state attr

550

self._headerdata = header

552

self._headerdata = header

551

self._headeroffset = 0

553

self._headeroffset = 0

552

self._initialized = False

554

self._initialized = False

553

self.consumed = False

555

self.consumed = False

554

# part data

556

# part data

555

self.id = None

557

self.id = None

556

self.type = None

558

self.type = None

557

self.mandatoryparams = None

559

self.mandatoryparams = None

558

self.advisoryparams = None

560

self.advisoryparams = None

559

self._payloadstream = None

561

self._payloadstream = None

560

self._readheader()

562

self._readheader()

561

563

562

def _fromheader(self, size):

564

def _fromheader(self, size):

563

"""return the next <size> byte from the header"""

565

"""return the next <size> byte from the header"""

564

offset = self._headeroffset

566

offset = self._headeroffset

565

data = self._headerdata[offset:(offset + size)]

567

data = self._headerdata[offset:(offset + size)]

566

self._headeroffset = offset + size

568

self._headeroffset = offset + size

567

return data

569

return data

568

570

569

def _unpackheader(self, format):

571

def _unpackheader(self, format):

570

"""read given format from header

572

"""read given format from header

571

573

572

This automatically compute the size of the format to read."""

574

This automatically compute the size of the format to read."""

573

data = self._fromheader(struct.calcsize(format))

575

data = self._fromheader(struct.calcsize(format))

574

return _unpack(format, data)

576

return _unpack(format, data)

575

577

576

def _readheader(self):

578

def _readheader(self):

577

"""read the header and setup the object"""

579

"""read the header and setup the object"""

578

typesize = self._unpackheader(_fparttypesize)[0]

580

typesize = self._unpackheader(_fparttypesize)[0]

579

self.type = self._fromheader(typesize)

581

self.type = self._fromheader(typesize)

580

self.ui.debug('part type: "%s"\n' % self.type)

582

self.ui.debug('part type: "%s"\n' % self.type)

581

self.id = self._unpackheader(_fpartid)[0]

583

self.id = self._unpackheader(_fpartid)[0]

582

self.ui.debug('part id: "%s"\n' % self.id)

584

self.ui.debug('part id: "%s"\n' % self.id)

583

## reading parameters

585

## reading parameters

584

# param count

586

# param count

585

mancount, advcount = self._unpackheader(_fpartparamcount)

587

mancount, advcount = self._unpackheader(_fpartparamcount)

586

self.ui.debug('part parameters: %i\n' % (mancount + advcount))

588

self.ui.debug('part parameters: %i\n' % (mancount + advcount))

587

# param size

589

# param size

588

fparamsizes = _makefpartparamsizes(mancount + advcount)

590

fparamsizes = _makefpartparamsizes(mancount + advcount)

589

paramsizes = self._unpackheader(fparamsizes)

591

paramsizes = self._unpackheader(fparamsizes)

590

# make it a list of couple again

592

# make it a list of couple again

591

paramsizes = zip(paramsizes[::2], paramsizes[1::2])

593

paramsizes = zip(paramsizes[::2], paramsizes[1::2])

592

# split mandatory from advisory

594

# split mandatory from advisory

593

mansizes = paramsizes[:mancount]

595

mansizes = paramsizes[:mancount]

594

advsizes = paramsizes[mancount:]

596

advsizes = paramsizes[mancount:]

595

# retrive param value

597

# retrive param value

596

manparams = []

598

manparams = []

597

for key, value in mansizes:

599

for key, value in mansizes:

598

manparams.append((self._fromheader(key), self._fromheader(value)))

600

manparams.append((self._fromheader(key), self._fromheader(value)))

599

advparams = []

601

advparams = []

600

for key, value in advsizes:

602

for key, value in advsizes:

601

advparams.append((self._fromheader(key), self._fromheader(value)))

603

advparams.append((self._fromheader(key), self._fromheader(value)))

602

self.mandatoryparams = manparams

604

self.mandatoryparams = manparams

603

self.advisoryparams = advparams

605

self.advisoryparams = advparams

604

## part payload

606

## part payload

605

def payloadchunks():

607

def payloadchunks():

606

payloadsize = self._unpack(_fpayloadsize)[0]

608

payloadsize = self._unpack(_fpayloadsize)[0]

607

self.ui.debug('payload chunk size: %i\n' % payloadsize)

609

self.ui.debug('payload chunk size: %i\n' % payloadsize)

608

while payloadsize:

610

while payloadsize:

609

yield self._readexact(payloadsize)

611

yield self._readexact(payloadsize)

610

payloadsize = self._unpack(_fpayloadsize)[0]

612

payloadsize = self._unpack(_fpayloadsize)[0]

611

self.ui.debug('payload chunk size: %i\n' % payloadsize)

613

self.ui.debug('payload chunk size: %i\n' % payloadsize)

612

self._payloadstream = util.chunkbuffer(payloadchunks())

614

self._payloadstream = util.chunkbuffer(payloadchunks())

613

# we read the data, tell it

615

# we read the data, tell it

614

self._initialized = True

616

self._initialized = True

615

617

616

def read(self, size=None):

618

def read(self, size=None):

617

"""read payload data"""

619

"""read payload data"""

618

if not self._initialized:

620

if not self._initialized:

619

self._readheader()

621

self._readheader()

620

if size is None:

622

if size is None:

621

data = self._payloadstream.read()

623

data = self._payloadstream.read()

622

else:

624

else:

623

data = self._payloadstream.read(size)

625

data = self._payloadstream.read(size)

624

if size is None or len(data) < size:

626

if size is None or len(data) < size:

625

self.consumed = True

627

self.consumed = True

626

return data

628

return data

627

629

628

630

629

@parthandler('changegroup')

631

@parthandler('changegroup')

630

def handlechangegroup(op, inpart):

632

def handlechangegroup(op, inpart):

631

"""apply a changegroup part on the repo

633

"""apply a changegroup part on the repo

632

634

633

This is a very early implementation that will massive rework before being

635

This is a very early implementation that will massive rework before being

634

inflicted to any end-user.

636

inflicted to any end-user.

635

"""

637

"""

636

# Make sure we trigger a transaction creation

638

# Make sure we trigger a transaction creation

637

#

639

#

638

# The addchangegroup function will get a transaction object by itself, but

640

# The addchangegroup function will get a transaction object by itself, but

639

# we need to make sure we trigger the creation of a transaction object used

641

# we need to make sure we trigger the creation of a transaction object used

640

# for the whole processing scope.

642

# for the whole processing scope.

641

op.gettransaction()

643

op.gettransaction()

642

cg = changegroup.unbundle10(inpart, 'UN')

644

cg = changegroup.unbundle10(inpart, 'UN')

643

ret = changegroup.addchangegroup(op.repo, cg, 'bundle2', 'bundle2')

645

ret = changegroup.addchangegroup(op.repo, cg, 'bundle2', 'bundle2')

644

op.records.add('changegroup', {'return': ret})

646

op.records.add('changegroup', {'return': ret})

645

if op.reply is not None:

647

if op.reply is not None:

646

# This is definitly not the final form of this

648

# This is definitly not the final form of this

647

# return. But one need to start somewhere.

649

# return. But one need to start somewhere.

648

part = bundlepart('reply:changegroup', (),

650

part = bundlepart('reply:changegroup', (),

649

[('in-reply-to', str(inpart.id)),

651

[('in-reply-to', str(inpart.id)),

650

('return', '%i' % ret)])

652

('return', '%i' % ret)])

651

op.reply.addpart(part)

653

op.reply.addpart(part)

652

assert not inpart.read()

654

assert not inpart.read()

653

655

654

@parthandler('reply:changegroup')

656

@parthandler('reply:changegroup')

655

def handlechangegroup(op, inpart):

657

def handlechangegroup(op, inpart):

656

p = dict(inpart.advisoryparams)

658

p = dict(inpart.advisoryparams)

657

ret = int(p['return'])

659

ret = int(p['return'])

658

op.records.add('changegroup', {'return': ret}, int(p['in-reply-to']))

660

op.records.add('changegroup', {'return': ret}, int(p['in-reply-to']))

659

661

660

@parthandler('check:heads')

662

@parthandler('check:heads')

661

def handlechangegroup(op, inpart):

663

def handlechangegroup(op, inpart):

662

"""check that head of the repo did not change

664

"""check that head of the repo did not change

663

665

664

This is used to detect a push race when using unbundle.

666

This is used to detect a push race when using unbundle.

665

This replaces the "heads" argument of unbundle."""

667

This replaces the "heads" argument of unbundle."""

666

h = inpart.read(20)

668

h = inpart.read(20)

667

heads = []

669

heads = []

668

while len(h) == 20:

670

while len(h) == 20:

669

heads.append(h)

671

heads.append(h)

670

h = inpart.read(20)

672

h = inpart.read(20)

671

assert not h

673

assert not h

672

if heads != op.repo.heads():

674

if heads != op.repo.heads():

673

raise exchange.PushRaced()

675

raise exchange.PushRaced()

	Site-wide shortcuts
/	Use quick search box
g h	Goto home page
g g	Goto my private gists page
g G	Goto my public gists page
g 0-9	Goto bookmarked items from 0-9
n r	New repository page
n g	New gist page

	Repositories
g s	Goto summary page
g c	Goto changelog page
g f	Goto files page
g F	Goto files page with file search activated
g p	Goto pull requests page
g o	Goto repository settings
g O	Goto repository access permissions settings
t s	Toggle sidebar on some pages

             # bundle2.py - generic container format to transmit arbitrary data.
             #
             # Copyright 2013 Facebook, Inc.
             #
             # This software may be used and distributed according to the terms of the
             # GNU General Public License version 2 or any later version.
             """Handling of the new bundle2 format
             The goal of bundle2 is to act as an atomically packet to transmit a set of
             payloads in an application agnostic way. It consist in a sequence of "parts"
             that will be handed to and processed by the application layer.
             General format architecture
             ===========================
             The format is architectured as follow
              - magic string
              - stream level parameters
              - payload parts (any number)
              - end of stream marker.
             the Binary format
             ============================
             All numbers are unsigned and big-endian.
             stream level parameters
             ------------------------
             Binary format is as follow
             :params size: (16 bits integer)
               The total number of Bytes used by the parameters
             :params value: arbitrary number of Bytes
               A blob of `params size` containing the serialized version of all stream level
               parameters.
               The blob contains a space separated list of parameters. Parameters with value
               are stored in the form `<name>=<value>`. Both name and value are urlquoted.
               Empty name are obviously forbidden.
               Name MUST start with a letter. If this first letter is lower case, the
               parameter is advisory and can be safely ignored. However when the first
               letter is capital, the parameter is mandatory and the bundling process MUST
               stop if he is not able to proceed it.
               Stream parameters use a simple textual format for two main reasons:
               - Stream level parameters should remain simple and we want to discourage any
                 crazy usage.
               - Textual data allow easy human inspection of a bundle2 header in case of
                 troubles.
               Any Applicative level options MUST go into a bundle2 part instead.
             Payload part
             ------------------------
             Binary format is as follow
             :header size: (16 bits inter)
               The total number of Bytes used by the part headers. When the header is empty
               (size = 0) this is interpreted as the end of stream marker.
             :header:
                 The header defines how to interpret the part. It contains two piece of
                 data: the part type, and the part parameters.
                 The part type is used to route an application level handler, that can
                 interpret payload.
                 Part parameters are passed to the application level handler.  They are
                 meant to convey information that will help the application level object to
                 interpret the part payload.
                 The binary format of the header is has follow
                 :typesize: (one byte)
                 :parttype: alphanumerical part name
                 :partid: A 32bits integer (unique in the bundle) that can be used to refer
                          to this part.
                 :parameters:
                     Part's parameter may have arbitrary content, the binary structure is::
                         <mandatory-count><advisory-count><param-sizes><param-data>
                     :mandatory-count: 1 byte, number of mandatory parameters
                     :advisory-count:  1 byte, number of advisory parameters
                     :param-sizes:
                         N couple of bytes, where N is the total number of parameters. Each
                         couple contains (<size-of-key>, <size-of-value) for one parameter.
                     :param-data:
                         A blob of bytes from which each parameter key and value can be
                         retrieved using the list of size couples stored in the previous
                         field.
                         Mandatory parameters comes first, then the advisory ones.
             :payload:
                 payload is a series of `<chunksize><chunkdata>`.
                 `chunksize` is a 32 bits integer, `chunkdata` are plain bytes (as much as
                 `chunksize` says)` The payload part is concluded by a zero size chunk.
                 The current implementation always produces either zero or one chunk.
                 This is an implementation limitation that will ultimately be lifted.
             Bundle processing
             ============================
             Each part is processed in order using a "part handler". Handler are registered
             for a certain part type.
             The matching of a part to its handler is case insensitive. The case of the
             part type is used to know if a part is mandatory or advisory. If the Part type
             contains any uppercase char it is considered mandatory. When no handler is
             known for a Mandatory part, the process is aborted and an exception is raised.
             If the part is advisory and no handler is known, the part is ignored. When the
             process is aborted, the full bundle is still read from the stream to keep the
             channel usable. But none of the part read from an abort are processed. In the
             future, dropping the stream may become an option for channel we do not care to
             preserve.
             """
             import util
             import struct
             import urllib
             import string
             import changegroup
             from i18n import _
             _pack = struct.pack
             _unpack = struct.unpack
             _magicstring = 'HG20'
             _fstreamparamsize = '>H'
             _fpartheadersize = '>H'
             _fparttypesize = '>B'
             _fpartid = '>I'
             _fpayloadsize = '>I'
             _fpartparamcount = '>BB'
             preferedchunksize = 4096
             def _makefpartparamsizes(nbparams):
                 """return a struct format to read part parameter sizes
                 The number parameters is variable so we need to build that format
                 dynamically.
                 """
                 return '>'+('BB'*nbparams)
             parthandlermapping = {}
             def parthandler(parttype):
                 """decorator that register a function as a bundle2 part handler
                 eg::
                     @parthandler('myparttype')
                     def myparttypehandler(...):
                         '''process a part of type "my part".'''
                         ...
                 """
                 def _decorator(func):
                     lparttype = parttype.lower() # enforce lower case matching.
                     assert lparttype not in parthandlermapping
                     parthandlermapping[lparttype] = func
                     return func
                 return _decorator
             class unbundlerecords(object):
                 """keep record of what happens during and unbundle
                 New records are added using `records.add('cat', obj)`. Where 'cat' is a
                 category of record and obj is an arbitrary object.
                 `records['cat']` will return all entries of this category 'cat'.
                 Iterating on the object itself will yield `('category', obj)` tuples
                 for all entries.
                 All iterations happens in chronological order.
                 """
                 def __init__(self):
                     self._categories = {}
                     self._sequences = []
                     self._replies = {}
                 def add(self, category, entry, inreplyto=None):
                     """add a new record of a given category.
                     The entry can then be retrieved in the list returned by
                     self['category']."""
                     self._categories.setdefault(category, []).append(entry)
                     self._sequences.append((category, entry))
                     if inreplyto is not None:
                         self.getreplies(inreplyto).add(category, entry)
                 def getreplies(self, partid):
                     """get the subrecords that replies to a specific part"""
                     return self._replies.setdefault(partid, unbundlerecords())
                 def __getitem__(self, cat):
                     return tuple(self._categories.get(cat, ()))
                 def __iter__(self):
                     return iter(self._sequences)
                 def __len__(self):
                     return len(self._sequences)
                 def __nonzero__(self):
                     return bool(self._sequences)
             class bundleoperation(object):
                 """an object that represents a single bundling process
                 Its purpose is to carry unbundle-related objects and states.
                 A new object should be created at the beginning of each bundle processing.
                 The object is to be returned by the processing function.
                 The object has very little content now it will ultimately contain:
                 * an access to the repo the bundle is applied to,
                 * a ui object,
                 * a way to retrieve a transaction to add changes to the repo,
                 * a way to record the result of processing each part,
                 * a way to construct a bundle response when applicable.
                 """
                 def __init__(self, repo, transactiongetter):
                     self.repo = repo
                     self.ui = repo.ui
                     self.records = unbundlerecords()
                     self.gettransaction = transactiongetter
                     self.reply = None
             class TransactionUnavailable(RuntimeError):
                 pass
             def _notransaction():
                 """default method to get a transaction while processing a bundle
                 Raise an exception to highlight the fact that no transaction was expected
                 to be created"""
                 raise TransactionUnavailable()
             def processbundle(repo, unbundler, transactiongetter=_notransaction):
                 """This function process a bundle, apply effect to/from a repo
                 It iterates over each part then searches for and uses the proper handling
                 code to process the part. Parts are processed in order.
                 This is very early version of this function that will be strongly reworked
                 before final usage.
                 Unknown Mandatory part will abort the process.
                 """
                 op = bundleoperation(repo, transactiongetter)
                 # todo:
                 # - only create reply bundle if requested.
                 op.reply = bundle20(op.ui)
                 # todo:
                 # - replace this is a init function soon.
                 # - exception catching
                 unbundler.params
                 iterparts = iter(unbundler)
                 part = None
                 try:
                     for part in iterparts:
                         parttype = part.type
                         # part key are matched lower case
                         key = parttype.lower()
                         try:
                             handler = parthandlermapping[key]
                             op.ui.debug('found a handler for part %r\n' % parttype)
                         except KeyError:
                             if key != parttype: # mandatory parts
                                 # todo:
                                 # - use a more precise exception
                                 raise
                             op.ui.debug('ignoring unknown advisory part %r\n' % key)
                             # consuming the part
                             part.read()
                             continue
                         # handler is called outside the above try block so that we don't
                         # risk catching KeyErrors from anything other than the
                         # parthandlermapping lookup (any KeyError raised by handler()
                         # itself represents a defect of a different variety).
                         handler(op, part)
                         part.read()
                 except Exception:
                     if part is not None:
                         # consume the bundle content
                         part.read()
                     for part in iterparts:
                         # consume the bundle content
                         part.read()
                     raise
                 return op
             class bundle20(object):
                 """represent an outgoing bundle2 container
                 Use the `addparam` method to add stream level parameter. and `addpart` to
                 populate it. Then call `getchunks` to retrieve all the binary chunks of
                 data that compose the bundle2 container."""
                 def __init__(self, ui):
                     self.ui = ui
                     self._params = []
                     self._parts = []
                 def addparam(self, name, value=None):
                     """add a stream level parameter"""
                     if not name:
                         raise ValueError('empty parameter name')
                     if name[0] not in string.letters:
                         raise ValueError('non letter first character: %r' % name)
                     self._params.append((name, value))
                 def addpart(self, part):
                     """add a new part to the bundle2 container
                     Parts contains the actual applicative payload."""
                     assert part.id is None
                     part.id = len(self._parts) # very cheap counter
                     self._parts.append(part)
                 def getchunks(self):
                     self.ui.debug('start emission of %s stream\n' % _magicstring)
                     yield _magicstring
                     param = self._paramchunk()
                     self.ui.debug('bundle parameter: %s\n' % param)
                     yield _pack(_fstreamparamsize, len(param))
                     if param:
                         yield param
                     self.ui.debug('start of parts\n')
                     for part in self._parts:
                         self.ui.debug('bundle part: "%s"\n' % part.type)
                         for chunk in part.getchunks():
                             yield chunk
                     self.ui.debug('end of bundle\n')
                     yield '\0\0'
                 def _paramchunk(self):
                     """return a encoded version of all stream parameters"""
                     blocks = []
                     for par, value in self._params:
                         par = urllib.quote(par)
                         if value is not None:
                             value = urllib.quote(value)
                             par = '%s=%s' % (par, value)
                         blocks.append(par)
                     return ' '.join(blocks)
             class unpackermixin(object):
                 """A mixin to extract bytes and struct data from a stream"""
                 def __init__(self, fp):
                     self._fp = fp
                 def _unpack(self, format):
                     """unpack this struct format from the stream"""
                     data = self._readexact(struct.calcsize(format))
                     return _unpack(format, data)
                 def _readexact(self, size):
                     """read exactly <size> bytes from the stream"""
                     return changegroup.readexactly(self._fp, size)
             class unbundle20(unpackermixin):
                 """interpret a bundle2 stream
                 (this will eventually yield parts)"""
-                def __init__(self, ui, fp):
+                def __init__(self, ui, fp, header=None):
+                    """If header is specified, we do not read it out of the stream."""
                     self.ui = ui
                     super(unbundle20, self).__init__(fp)
-                    header = self._readexact(4)
+                    if header is None:
-                    magic, version = header[0:2], header[2:4]
+                        header = self._readexact(4)
-                    if magic != 'HG':
+                        magic, version = header[0:2], header[2:4]
-                        raise util.Abort(_('not a Mercurial bundle'))
+                        if magic != 'HG':
-                    if version != '20':
+                            raise util.Abort(_('not a Mercurial bundle'))
-                        raise util.Abort(_('unknown bundle version %s') % version)
+                        if version != '20':
+                            raise util.Abort(_('unknown bundle version %s') % version)
                     self.ui.debug('start processing of %s stream\n' % header)
                 @util.propertycache
                 def params(self):
                     """dictionary of stream level parameters"""
                     self.ui.debug('reading bundle2 stream parameters\n')
                     params = {}
                     paramssize = self._unpack(_fstreamparamsize)[0]
                     if paramssize:
                         for p in self._readexact(paramssize).split(' '):
                             p = p.split('=', 1)
                             p = [urllib.unquote(i) for i in p]
                             if len(p) < 2:
                                 p.append(None)
                             self._processparam(*p)
                             params[p[0]] = p[1]
                     return params
                 def _processparam(self, name, value):
                     """process a parameter, applying its effect if needed
                     Parameter starting with a lower case letter are advisory and will be
                     ignored when unknown.  Those starting with an upper case letter are
                     mandatory and will this function will raise a KeyError when unknown.
                     Note: no option are currently supported. Any input will be either
                           ignored or failing.
                     """
                     if not name:
                         raise ValueError('empty parameter name')
                     if name[0] not in string.letters:
                         raise ValueError('non letter first character: %r' % name)
                     # Some logic will be later added here to try to process the option for
                     # a dict of known parameter.
                     if name[0].islower():
                         self.ui.debug("ignoring unknown parameter %r\n" % name)
                     else:
                         raise KeyError(name)
                 def __iter__(self):
                     """yield all parts contained in the stream"""
                     # make sure param have been loaded
                     self.params
                     self.ui.debug('start extraction of bundle2 parts\n')
                     headerblock = self._readpartheader()
                     while headerblock is not None:
                         part = unbundlepart(self.ui, headerblock, self._fp)
                         yield part
                         headerblock = self._readpartheader()
                     self.ui.debug('end of bundle2 stream\n')
                 def _readpartheader(self):
                     """reads a part header size and return the bytes blob
                     returns None if empty"""
                     headersize = self._unpack(_fpartheadersize)[0]
                     self.ui.debug('part header size: %i\n' % headersize)
                     if headersize:
                         return self._readexact(headersize)
                     return None
             class bundlepart(object):
                 """A bundle2 part contains application level payload
                 The part `type` is used to route the part to the application level
                 handler.
                 """
                 def __init__(self, parttype, mandatoryparams=(), advisoryparams=(),
                              data=''):
                     self.id = None
                     self.type = parttype
                     self.data = data
                     self.mandatoryparams = mandatoryparams
                     self.advisoryparams = advisoryparams
                 def getchunks(self):
                     #### header
                     ## parttype
                     header = [_pack(_fparttypesize, len(self.type)),
                               self.type, _pack(_fpartid, self.id),
                              ]
                     ## parameters
                     # count
                     manpar = self.mandatoryparams
                     advpar = self.advisoryparams
                     header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))
                     # size
                     parsizes = []
                     for key, value in manpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     for key, value in advpar:
                         parsizes.append(len(key))
                         parsizes.append(len(value))
                     paramsizes = _pack(_makefpartparamsizes(len(parsizes) / 2), *parsizes)
                     header.append(paramsizes)
                     # key, value
                     for key, value in manpar:
                         header.append(key)
                         header.append(value)
                     for key, value in advpar:
                         header.append(key)
                         header.append(value)
                     ## finalize header
                     headerchunk = ''.join(header)
                     yield _pack(_fpartheadersize, len(headerchunk))
                     yield headerchunk
                     ## payload
                     for chunk in self._payloadchunks():
                         yield _pack(_fpayloadsize, len(chunk))
                         yield chunk
                     # end of payload
                     yield _pack(_fpayloadsize, 0)
                 def _payloadchunks(self):
                     """yield chunks of a the part payload
                     Exists to handle the different methods to provide data to a part."""
                     # we only support fixed size data now.
                     # This will be improved in the future.
                     if util.safehasattr(self.data, 'next'):
                         buff = util.chunkbuffer(self.data)
                         chunk = buff.read(preferedchunksize)
                         while chunk:
                             yield chunk
                             chunk = buff.read(preferedchunksize)
                     elif len(self.data):
                         yield self.data
             class unbundlepart(unpackermixin):
                 """a bundle part read from a bundle"""
                 def __init__(self, ui, header, fp):
                     super(unbundlepart, self).__init__(fp)
                     self.ui = ui
                     # unbundle state attr
                     self._headerdata = header
                     self._headeroffset = 0
                     self._initialized = False
                     self.consumed = False
                     # part data
                     self.id = None
                     self.type = None
                     self.mandatoryparams = None
                     self.advisoryparams = None
                     self._payloadstream = None
                     self._readheader()
                 def _fromheader(self, size):
                     """return the next <size> byte from the header"""
                     offset = self._headeroffset
                     data = self._headerdata[offset:(offset + size)]
                     self._headeroffset = offset + size
                     return data
                 def _unpackheader(self, format):
                     """read given format from header
                     This automatically compute the size of the format to read."""
                     data = self._fromheader(struct.calcsize(format))
                     return _unpack(format, data)
                 def _readheader(self):
                     """read the header and setup the object"""
                     typesize = self._unpackheader(_fparttypesize)[0]
                     self.type = self._fromheader(typesize)
                     self.ui.debug('part type: "%s"\n' % self.type)
                     self.id = self._unpackheader(_fpartid)[0]
                     self.ui.debug('part id: "%s"\n' % self.id)
                     ## reading parameters
                     # param count
                     mancount, advcount = self._unpackheader(_fpartparamcount)
                     self.ui.debug('part parameters: %i\n' % (mancount + advcount))
                     # param size
                     fparamsizes = _makefpartparamsizes(mancount + advcount)
                     paramsizes = self._unpackheader(fparamsizes)
                     # make it a list of couple again
                     paramsizes = zip(paramsizes[::2], paramsizes[1::2])
                     # split mandatory from advisory
                     mansizes = paramsizes[:mancount]
                     advsizes = paramsizes[mancount:]
                     # retrive param value
                     manparams = []
                     for key, value in mansizes:
                         manparams.append((self._fromheader(key), self._fromheader(value)))
                     advparams = []
                     for key, value in advsizes:
                         advparams.append((self._fromheader(key), self._fromheader(value)))
                     self.mandatoryparams = manparams
                     self.advisoryparams  = advparams
                     ## part payload
                     def payloadchunks():
                         payloadsize = self._unpack(_fpayloadsize)[0]
                         self.ui.debug('payload chunk size: %i\n' % payloadsize)
                         while payloadsize:
                             yield self._readexact(payloadsize)
                             payloadsize = self._unpack(_fpayloadsize)[0]
                             self.ui.debug('payload chunk size: %i\n' % payloadsize)
                     self._payloadstream = util.chunkbuffer(payloadchunks())
                     # we read the data, tell it
                     self._initialized = True
                 def read(self, size=None):
                     """read payload data"""
                     if not self._initialized:
                         self._readheader()
                     if size is None:
                         data = self._payloadstream.read()
                     else:
                         data = self._payloadstream.read(size)
                     if size is None or len(data) < size:
                         self.consumed = True
                     return data
             @parthandler('changegroup')
             def handlechangegroup(op, inpart):
                 """apply a changegroup part on the repo
                 This is a very early implementation that will massive rework before being
                 inflicted to any end-user.
                 """
                 # Make sure we trigger a transaction creation
                 #
                 # The addchangegroup function will get a transaction object by itself, but
                 # we need to make sure we trigger the creation of a transaction object used
                 # for the whole processing scope.
                 op.gettransaction()
                 cg = changegroup.unbundle10(inpart, 'UN')
                 ret = changegroup.addchangegroup(op.repo, cg, 'bundle2', 'bundle2')
                 op.records.add('changegroup', {'return': ret})
                 if op.reply is not None:
                     # This is definitly not the final form of this
                     # return. But one need to start somewhere.
                     part = bundlepart('reply:changegroup', (),
                                        [('in-reply-to', str(inpart.id)),
                                         ('return', '%i' % ret)])
                     op.reply.addpart(part)
                 assert not inpart.read()
             @parthandler('reply:changegroup')
             def handlechangegroup(op, inpart):
                 p = dict(inpart.advisoryparams)
                 ret = int(p['return'])
                 op.records.add('changegroup', {'return': ret}, int(p['in-reply-to']))
             @parthandler('check:heads')
             def handlechangegroup(op, inpart):
                 """check that head of the repo did not change
                 This is used to detect a push race when using unbundle.
                 This replaces the "heads" argument of unbundle."""
                 h = inpart.read(20)
                 heads = []
                 while len(h) == 20:
                     heads.append(h)
                     h = inpart.read(20)
                 assert not h
                 if heads != op.repo.heads():
                     raise exchange.PushRaced()