##// END OF EJS Templates
safehasattr: pass attribute name as string instead of bytes...
marmoute -
r51509:7200a9d4 default
parent child Browse files
Show More
@@ -1,2669 +1,2669 b''
1 1 # bundle2.py - generic container format to transmit arbitrary data.
2 2 #
3 3 # Copyright 2013 Facebook, Inc.
4 4 #
5 5 # This software may be used and distributed according to the terms of the
6 6 # GNU General Public License version 2 or any later version.
7 7 """Handling of the new bundle2 format
8 8
9 9 The goal of bundle2 is to act as an atomically packet to transmit a set of
10 10 payloads in an application agnostic way. It consist in a sequence of "parts"
11 11 that will be handed to and processed by the application layer.
12 12
13 13
14 14 General format architecture
15 15 ===========================
16 16
17 17 The format is architectured as follow
18 18
19 19 - magic string
20 20 - stream level parameters
21 21 - payload parts (any number)
22 22 - end of stream marker.
23 23
24 24 the Binary format
25 25 ============================
26 26
27 27 All numbers are unsigned and big-endian.
28 28
29 29 stream level parameters
30 30 ------------------------
31 31
32 32 Binary format is as follow
33 33
34 34 :params size: int32
35 35
36 36 The total number of Bytes used by the parameters
37 37
38 38 :params value: arbitrary number of Bytes
39 39
40 40 A blob of `params size` containing the serialized version of all stream level
41 41 parameters.
42 42
43 43 The blob contains a space separated list of parameters. Parameters with value
44 44 are stored in the form `<name>=<value>`. Both name and value are urlquoted.
45 45
46 46 Empty name are obviously forbidden.
47 47
48 48 Name MUST start with a letter. If this first letter is lower case, the
49 49 parameter is advisory and can be safely ignored. However when the first
50 50 letter is capital, the parameter is mandatory and the bundling process MUST
51 51 stop if he is not able to proceed it.
52 52
53 53 Stream parameters use a simple textual format for two main reasons:
54 54
55 55 - Stream level parameters should remain simple and we want to discourage any
56 56 crazy usage.
57 57 - Textual data allow easy human inspection of a bundle2 header in case of
58 58 troubles.
59 59
60 60 Any Applicative level options MUST go into a bundle2 part instead.
61 61
62 62 Payload part
63 63 ------------------------
64 64
65 65 Binary format is as follow
66 66
67 67 :header size: int32
68 68
69 69 The total number of Bytes used by the part header. When the header is empty
70 70 (size = 0) this is interpreted as the end of stream marker.
71 71
72 72 :header:
73 73
74 74 The header defines how to interpret the part. It contains two piece of
75 75 data: the part type, and the part parameters.
76 76
77 77 The part type is used to route an application level handler, that can
78 78 interpret payload.
79 79
80 80 Part parameters are passed to the application level handler. They are
81 81 meant to convey information that will help the application level object to
82 82 interpret the part payload.
83 83
84 84 The binary format of the header is has follow
85 85
86 86 :typesize: (one byte)
87 87
88 88 :parttype: alphanumerical part name (restricted to [a-zA-Z0-9_:-]*)
89 89
90 90 :partid: A 32bits integer (unique in the bundle) that can be used to refer
91 91 to this part.
92 92
93 93 :parameters:
94 94
95 95 Part's parameter may have arbitrary content, the binary structure is::
96 96
97 97 <mandatory-count><advisory-count><param-sizes><param-data>
98 98
99 99 :mandatory-count: 1 byte, number of mandatory parameters
100 100
101 101 :advisory-count: 1 byte, number of advisory parameters
102 102
103 103 :param-sizes:
104 104
105 105 N couple of bytes, where N is the total number of parameters. Each
106 106 couple contains (<size-of-key>, <size-of-value) for one parameter.
107 107
108 108 :param-data:
109 109
110 110 A blob of bytes from which each parameter key and value can be
111 111 retrieved using the list of size couples stored in the previous
112 112 field.
113 113
114 114 Mandatory parameters comes first, then the advisory ones.
115 115
116 116 Each parameter's key MUST be unique within the part.
117 117
118 118 :payload:
119 119
120 120 payload is a series of `<chunksize><chunkdata>`.
121 121
122 122 `chunksize` is an int32, `chunkdata` are plain bytes (as much as
123 123 `chunksize` says)` The payload part is concluded by a zero size chunk.
124 124
125 125 The current implementation always produces either zero or one chunk.
126 126 This is an implementation limitation that will ultimately be lifted.
127 127
128 128 `chunksize` can be negative to trigger special case processing. No such
129 129 processing is in place yet.
130 130
131 131 Bundle processing
132 132 ============================
133 133
134 134 Each part is processed in order using a "part handler". Handler are registered
135 135 for a certain part type.
136 136
137 137 The matching of a part to its handler is case insensitive. The case of the
138 138 part type is used to know if a part is mandatory or advisory. If the Part type
139 139 contains any uppercase char it is considered mandatory. When no handler is
140 140 known for a Mandatory part, the process is aborted and an exception is raised.
141 141 If the part is advisory and no handler is known, the part is ignored. When the
142 142 process is aborted, the full bundle is still read from the stream to keep the
143 143 channel usable. But none of the part read from an abort are processed. In the
144 144 future, dropping the stream may become an option for channel we do not care to
145 145 preserve.
146 146 """
147 147
148 148
149 149 import collections
150 150 import errno
151 151 import os
152 152 import re
153 153 import string
154 154 import struct
155 155 import sys
156 156
157 157 from .i18n import _
158 158 from .node import (
159 159 hex,
160 160 short,
161 161 )
162 162 from . import (
163 163 bookmarks,
164 164 changegroup,
165 165 encoding,
166 166 error,
167 167 obsolete,
168 168 phases,
169 169 pushkey,
170 170 pycompat,
171 171 requirements,
172 172 scmutil,
173 173 streamclone,
174 174 tags,
175 175 url,
176 176 util,
177 177 )
178 178 from .utils import (
179 179 stringutil,
180 180 urlutil,
181 181 )
182 182 from .interfaces import repository
183 183
184 184 urlerr = util.urlerr
185 185 urlreq = util.urlreq
186 186
187 187 _pack = struct.pack
188 188 _unpack = struct.unpack
189 189
190 190 _fstreamparamsize = b'>i'
191 191 _fpartheadersize = b'>i'
192 192 _fparttypesize = b'>B'
193 193 _fpartid = b'>I'
194 194 _fpayloadsize = b'>i'
195 195 _fpartparamcount = b'>BB'
196 196
197 197 preferedchunksize = 32768
198 198
199 199 _parttypeforbidden = re.compile(b'[^a-zA-Z0-9_:-]')
200 200
201 201
202 202 def outdebug(ui, message):
203 203 """debug regarding output stream (bundling)"""
204 204 if ui.configbool(b'devel', b'bundle2.debug'):
205 205 ui.debug(b'bundle2-output: %s\n' % message)
206 206
207 207
208 208 def indebug(ui, message):
209 209 """debug on input stream (unbundling)"""
210 210 if ui.configbool(b'devel', b'bundle2.debug'):
211 211 ui.debug(b'bundle2-input: %s\n' % message)
212 212
213 213
214 214 def validateparttype(parttype):
215 215 """raise ValueError if a parttype contains invalid character"""
216 216 if _parttypeforbidden.search(parttype):
217 217 raise ValueError(parttype)
218 218
219 219
220 220 def _makefpartparamsizes(nbparams):
221 221 """return a struct format to read part parameter sizes
222 222
223 223 The number parameters is variable so we need to build that format
224 224 dynamically.
225 225 """
226 226 return b'>' + (b'BB' * nbparams)
227 227
228 228
229 229 parthandlermapping = {}
230 230
231 231
232 232 def parthandler(parttype, params=()):
233 233 """decorator that register a function as a bundle2 part handler
234 234
235 235 eg::
236 236
237 237 @parthandler('myparttype', ('mandatory', 'param', 'handled'))
238 238 def myparttypehandler(...):
239 239 '''process a part of type "my part".'''
240 240 ...
241 241 """
242 242 validateparttype(parttype)
243 243
244 244 def _decorator(func):
245 245 lparttype = parttype.lower() # enforce lower case matching.
246 246 assert lparttype not in parthandlermapping
247 247 parthandlermapping[lparttype] = func
248 248 func.params = frozenset(params)
249 249 return func
250 250
251 251 return _decorator
252 252
253 253
254 254 class unbundlerecords:
255 255 """keep record of what happens during and unbundle
256 256
257 257 New records are added using `records.add('cat', obj)`. Where 'cat' is a
258 258 category of record and obj is an arbitrary object.
259 259
260 260 `records['cat']` will return all entries of this category 'cat'.
261 261
262 262 Iterating on the object itself will yield `('category', obj)` tuples
263 263 for all entries.
264 264
265 265 All iterations happens in chronological order.
266 266 """
267 267
268 268 def __init__(self):
269 269 self._categories = {}
270 270 self._sequences = []
271 271 self._replies = {}
272 272
273 273 def add(self, category, entry, inreplyto=None):
274 274 """add a new record of a given category.
275 275
276 276 The entry can then be retrieved in the list returned by
277 277 self['category']."""
278 278 self._categories.setdefault(category, []).append(entry)
279 279 self._sequences.append((category, entry))
280 280 if inreplyto is not None:
281 281 self.getreplies(inreplyto).add(category, entry)
282 282
283 283 def getreplies(self, partid):
284 284 """get the records that are replies to a specific part"""
285 285 return self._replies.setdefault(partid, unbundlerecords())
286 286
287 287 def __getitem__(self, cat):
288 288 return tuple(self._categories.get(cat, ()))
289 289
290 290 def __iter__(self):
291 291 return iter(self._sequences)
292 292
293 293 def __len__(self):
294 294 return len(self._sequences)
295 295
296 296 def __nonzero__(self):
297 297 return bool(self._sequences)
298 298
299 299 __bool__ = __nonzero__
300 300
301 301
302 302 class bundleoperation:
303 303 """an object that represents a single bundling process
304 304
305 305 Its purpose is to carry unbundle-related objects and states.
306 306
307 307 A new object should be created at the beginning of each bundle processing.
308 308 The object is to be returned by the processing function.
309 309
310 310 The object has very little content now it will ultimately contain:
311 311 * an access to the repo the bundle is applied to,
312 312 * a ui object,
313 313 * a way to retrieve a transaction to add changes to the repo,
314 314 * a way to record the result of processing each part,
315 315 * a way to construct a bundle response when applicable.
316 316 """
317 317
318 318 def __init__(
319 319 self,
320 320 repo,
321 321 transactiongetter,
322 322 captureoutput=True,
323 323 source=b'',
324 324 remote=None,
325 325 ):
326 326 self.repo = repo
327 327 # the peer object who produced this bundle if available
328 328 self.remote = remote
329 329 self.ui = repo.ui
330 330 self.records = unbundlerecords()
331 331 self.reply = None
332 332 self.captureoutput = captureoutput
333 333 self.hookargs = {}
334 334 self._gettransaction = transactiongetter
335 335 # carries value that can modify part behavior
336 336 self.modes = {}
337 337 self.source = source
338 338
339 339 def gettransaction(self):
340 340 transaction = self._gettransaction()
341 341
342 342 if self.hookargs:
343 343 # the ones added to the transaction supercede those added
344 344 # to the operation.
345 345 self.hookargs.update(transaction.hookargs)
346 346 transaction.hookargs = self.hookargs
347 347
348 348 # mark the hookargs as flushed. further attempts to add to
349 349 # hookargs will result in an abort.
350 350 self.hookargs = None
351 351
352 352 return transaction
353 353
354 354 def addhookargs(self, hookargs):
355 355 if self.hookargs is None:
356 356 raise error.ProgrammingError(
357 357 b'attempted to add hookargs to '
358 358 b'operation after transaction started'
359 359 )
360 360 self.hookargs.update(hookargs)
361 361
362 362
363 363 class TransactionUnavailable(RuntimeError):
364 364 pass
365 365
366 366
367 367 def _notransaction():
368 368 """default method to get a transaction while processing a bundle
369 369
370 370 Raise an exception to highlight the fact that no transaction was expected
371 371 to be created"""
372 372 raise TransactionUnavailable()
373 373
374 374
375 375 def applybundle(repo, unbundler, tr, source, url=None, remote=None, **kwargs):
376 376 # transform me into unbundler.apply() as soon as the freeze is lifted
377 377 if isinstance(unbundler, unbundle20):
378 378 tr.hookargs[b'bundle2'] = b'1'
379 379 if source is not None and b'source' not in tr.hookargs:
380 380 tr.hookargs[b'source'] = source
381 381 if url is not None and b'url' not in tr.hookargs:
382 382 tr.hookargs[b'url'] = url
383 383 return processbundle(
384 384 repo, unbundler, lambda: tr, source=source, remote=remote
385 385 )
386 386 else:
387 387 # the transactiongetter won't be used, but we might as well set it
388 388 op = bundleoperation(repo, lambda: tr, source=source, remote=remote)
389 389 _processchangegroup(op, unbundler, tr, source, url, **kwargs)
390 390 return op
391 391
392 392
393 393 class partiterator:
394 394 def __init__(self, repo, op, unbundler):
395 395 self.repo = repo
396 396 self.op = op
397 397 self.unbundler = unbundler
398 398 self.iterator = None
399 399 self.count = 0
400 400 self.current = None
401 401
402 402 def __enter__(self):
403 403 def func():
404 404 itr = enumerate(self.unbundler.iterparts(), 1)
405 405 for count, p in itr:
406 406 self.count = count
407 407 self.current = p
408 408 yield p
409 409 p.consume()
410 410 self.current = None
411 411
412 412 self.iterator = func()
413 413 return self.iterator
414 414
415 415 def __exit__(self, type, exc, tb):
416 416 if not self.iterator:
417 417 return
418 418
419 419 # Only gracefully abort in a normal exception situation. User aborts
420 420 # like Ctrl+C throw a KeyboardInterrupt which is not a base Exception,
421 421 # and should not gracefully cleanup.
422 422 if isinstance(exc, Exception):
423 423 # Any exceptions seeking to the end of the bundle at this point are
424 424 # almost certainly related to the underlying stream being bad.
425 425 # And, chances are that the exception we're handling is related to
426 426 # getting in that bad state. So, we swallow the seeking error and
427 427 # re-raise the original error.
428 428 seekerror = False
429 429 try:
430 430 if self.current:
431 431 # consume the part content to not corrupt the stream.
432 432 self.current.consume()
433 433
434 434 for part in self.iterator:
435 435 # consume the bundle content
436 436 part.consume()
437 437 except Exception:
438 438 seekerror = True
439 439
440 440 # Small hack to let caller code distinguish exceptions from bundle2
441 441 # processing from processing the old format. This is mostly needed
442 442 # to handle different return codes to unbundle according to the type
443 443 # of bundle. We should probably clean up or drop this return code
444 444 # craziness in a future version.
445 445 exc.duringunbundle2 = True
446 446 salvaged = []
447 447 replycaps = None
448 448 if self.op.reply is not None:
449 449 salvaged = self.op.reply.salvageoutput()
450 450 replycaps = self.op.reply.capabilities
451 451 exc._replycaps = replycaps
452 452 exc._bundle2salvagedoutput = salvaged
453 453
454 454 # Re-raising from a variable loses the original stack. So only use
455 455 # that form if we need to.
456 456 if seekerror:
457 457 raise exc
458 458
459 459 self.repo.ui.debug(
460 460 b'bundle2-input-bundle: %i parts total\n' % self.count
461 461 )
462 462
463 463
464 464 def processbundle(
465 465 repo,
466 466 unbundler,
467 467 transactiongetter=None,
468 468 op=None,
469 469 source=b'',
470 470 remote=None,
471 471 ):
472 472 """This function process a bundle, apply effect to/from a repo
473 473
474 474 It iterates over each part then searches for and uses the proper handling
475 475 code to process the part. Parts are processed in order.
476 476
477 477 Unknown Mandatory part will abort the process.
478 478
479 479 It is temporarily possible to provide a prebuilt bundleoperation to the
480 480 function. This is used to ensure output is properly propagated in case of
481 481 an error during the unbundling. This output capturing part will likely be
482 482 reworked and this ability will probably go away in the process.
483 483 """
484 484 if op is None:
485 485 if transactiongetter is None:
486 486 transactiongetter = _notransaction
487 487 op = bundleoperation(
488 488 repo,
489 489 transactiongetter,
490 490 source=source,
491 491 remote=remote,
492 492 )
493 493 # todo:
494 494 # - replace this is a init function soon.
495 495 # - exception catching
496 496 unbundler.params
497 497 if repo.ui.debugflag:
498 498 msg = [b'bundle2-input-bundle:']
499 499 if unbundler.params:
500 500 msg.append(b' %i params' % len(unbundler.params))
501 501 if op._gettransaction is None or op._gettransaction is _notransaction:
502 502 msg.append(b' no-transaction')
503 503 else:
504 504 msg.append(b' with-transaction')
505 505 msg.append(b'\n')
506 506 repo.ui.debug(b''.join(msg))
507 507
508 508 processparts(repo, op, unbundler)
509 509
510 510 return op
511 511
512 512
513 513 def processparts(repo, op, unbundler):
514 514 with partiterator(repo, op, unbundler) as parts:
515 515 for part in parts:
516 516 _processpart(op, part)
517 517
518 518
519 519 def _processchangegroup(op, cg, tr, source, url, **kwargs):
520 520 if op.remote is not None and op.remote.path is not None:
521 521 remote_path = op.remote.path
522 522 kwargs = kwargs.copy()
523 523 kwargs['delta_base_reuse_policy'] = remote_path.delta_reuse_policy
524 524 ret = cg.apply(op.repo, tr, source, url, **kwargs)
525 525 op.records.add(
526 526 b'changegroup',
527 527 {
528 528 b'return': ret,
529 529 },
530 530 )
531 531 return ret
532 532
533 533
534 534 def _gethandler(op, part):
535 535 status = b'unknown' # used by debug output
536 536 try:
537 537 handler = parthandlermapping.get(part.type)
538 538 if handler is None:
539 539 status = b'unsupported-type'
540 540 raise error.BundleUnknownFeatureError(parttype=part.type)
541 541 indebug(op.ui, b'found a handler for part %s' % part.type)
542 542 unknownparams = part.mandatorykeys - handler.params
543 543 if unknownparams:
544 544 unknownparams = list(unknownparams)
545 545 unknownparams.sort()
546 546 status = b'unsupported-params (%s)' % b', '.join(unknownparams)
547 547 raise error.BundleUnknownFeatureError(
548 548 parttype=part.type, params=unknownparams
549 549 )
550 550 status = b'supported'
551 551 except error.BundleUnknownFeatureError as exc:
552 552 if part.mandatory: # mandatory parts
553 553 raise
554 554 indebug(op.ui, b'ignoring unsupported advisory part %s' % exc)
555 555 return # skip to part processing
556 556 finally:
557 557 if op.ui.debugflag:
558 558 msg = [b'bundle2-input-part: "%s"' % part.type]
559 559 if not part.mandatory:
560 560 msg.append(b' (advisory)')
561 561 nbmp = len(part.mandatorykeys)
562 562 nbap = len(part.params) - nbmp
563 563 if nbmp or nbap:
564 564 msg.append(b' (params:')
565 565 if nbmp:
566 566 msg.append(b' %i mandatory' % nbmp)
567 567 if nbap:
568 568 msg.append(b' %i advisory' % nbmp)
569 569 msg.append(b')')
570 570 msg.append(b' %s\n' % status)
571 571 op.ui.debug(b''.join(msg))
572 572
573 573 return handler
574 574
575 575
576 576 def _processpart(op, part):
577 577 """process a single part from a bundle
578 578
579 579 The part is guaranteed to have been fully consumed when the function exits
580 580 (even if an exception is raised)."""
581 581 handler = _gethandler(op, part)
582 582 if handler is None:
583 583 return
584 584
585 585 # handler is called outside the above try block so that we don't
586 586 # risk catching KeyErrors from anything other than the
587 587 # parthandlermapping lookup (any KeyError raised by handler()
588 588 # itself represents a defect of a different variety).
589 589 output = None
590 590 if op.captureoutput and op.reply is not None:
591 591 op.ui.pushbuffer(error=True, subproc=True)
592 592 output = b''
593 593 try:
594 594 handler(op, part)
595 595 finally:
596 596 if output is not None:
597 597 output = op.ui.popbuffer()
598 598 if output:
599 599 outpart = op.reply.newpart(b'output', data=output, mandatory=False)
600 600 outpart.addparam(
601 601 b'in-reply-to', pycompat.bytestr(part.id), mandatory=False
602 602 )
603 603
604 604
605 605 def decodecaps(blob):
606 606 """decode a bundle2 caps bytes blob into a dictionary
607 607
608 608 The blob is a list of capabilities (one per line)
609 609 Capabilities may have values using a line of the form::
610 610
611 611 capability=value1,value2,value3
612 612
613 613 The values are always a list."""
614 614 caps = {}
615 615 for line in blob.splitlines():
616 616 if not line:
617 617 continue
618 618 if b'=' not in line:
619 619 key, vals = line, ()
620 620 else:
621 621 key, vals = line.split(b'=', 1)
622 622 vals = vals.split(b',')
623 623 key = urlreq.unquote(key)
624 624 vals = [urlreq.unquote(v) for v in vals]
625 625 caps[key] = vals
626 626 return caps
627 627
628 628
629 629 def encodecaps(caps):
630 630 """encode a bundle2 caps dictionary into a bytes blob"""
631 631 chunks = []
632 632 for ca in sorted(caps):
633 633 vals = caps[ca]
634 634 ca = urlreq.quote(ca)
635 635 vals = [urlreq.quote(v) for v in vals]
636 636 if vals:
637 637 ca = b"%s=%s" % (ca, b','.join(vals))
638 638 chunks.append(ca)
639 639 return b'\n'.join(chunks)
640 640
641 641
642 642 bundletypes = {
643 643 b"": (b"", b'UN'), # only when using unbundle on ssh and old http servers
644 644 # since the unification ssh accepts a header but there
645 645 # is no capability signaling it.
646 646 b"HG20": (), # special-cased below
647 647 b"HG10UN": (b"HG10UN", b'UN'),
648 648 b"HG10BZ": (b"HG10", b'BZ'),
649 649 b"HG10GZ": (b"HG10GZ", b'GZ'),
650 650 }
651 651
652 652 # hgweb uses this list to communicate its preferred type
653 653 bundlepriority = [b'HG10GZ', b'HG10BZ', b'HG10UN']
654 654
655 655
656 656 class bundle20:
657 657 """represent an outgoing bundle2 container
658 658
659 659 Use the `addparam` method to add stream level parameter. and `newpart` to
660 660 populate it. Then call `getchunks` to retrieve all the binary chunks of
661 661 data that compose the bundle2 container."""
662 662
663 663 _magicstring = b'HG20'
664 664
665 665 def __init__(self, ui, capabilities=()):
666 666 self.ui = ui
667 667 self._params = []
668 668 self._parts = []
669 669 self.capabilities = dict(capabilities)
670 670 self._compengine = util.compengines.forbundletype(b'UN')
671 671 self._compopts = None
672 672 # If compression is being handled by a consumer of the raw
673 673 # data (e.g. the wire protocol), unsetting this flag tells
674 674 # consumers that the bundle is best left uncompressed.
675 675 self.prefercompressed = True
676 676
677 677 def setcompression(self, alg, compopts=None):
678 678 """setup core part compression to <alg>"""
679 679 if alg in (None, b'UN'):
680 680 return
681 681 assert not any(n.lower() == b'compression' for n, v in self._params)
682 682 self.addparam(b'Compression', alg)
683 683 self._compengine = util.compengines.forbundletype(alg)
684 684 self._compopts = compopts
685 685
686 686 @property
687 687 def nbparts(self):
688 688 """total number of parts added to the bundler"""
689 689 return len(self._parts)
690 690
691 691 # methods used to defines the bundle2 content
692 692 def addparam(self, name, value=None):
693 693 """add a stream level parameter"""
694 694 if not name:
695 695 raise error.ProgrammingError(b'empty parameter name')
696 696 if name[0:1] not in pycompat.bytestr(
697 697 string.ascii_letters # pytype: disable=wrong-arg-types
698 698 ):
699 699 raise error.ProgrammingError(
700 700 b'non letter first character: %s' % name
701 701 )
702 702 self._params.append((name, value))
703 703
704 704 def addpart(self, part):
705 705 """add a new part to the bundle2 container
706 706
707 707 Parts contains the actual applicative payload."""
708 708 assert part.id is None
709 709 part.id = len(self._parts) # very cheap counter
710 710 self._parts.append(part)
711 711
712 712 def newpart(self, typeid, *args, **kwargs):
713 713 """create a new part and add it to the containers
714 714
715 715 As the part is directly added to the containers. For now, this means
716 716 that any failure to properly initialize the part after calling
717 717 ``newpart`` should result in a failure of the whole bundling process.
718 718
719 719 You can still fall back to manually create and add if you need better
720 720 control."""
721 721 part = bundlepart(typeid, *args, **kwargs)
722 722 self.addpart(part)
723 723 return part
724 724
725 725 # methods used to generate the bundle2 stream
726 726 def getchunks(self):
727 727 if self.ui.debugflag:
728 728 msg = [b'bundle2-output-bundle: "%s",' % self._magicstring]
729 729 if self._params:
730 730 msg.append(b' (%i params)' % len(self._params))
731 731 msg.append(b' %i parts total\n' % len(self._parts))
732 732 self.ui.debug(b''.join(msg))
733 733 outdebug(self.ui, b'start emission of %s stream' % self._magicstring)
734 734 yield self._magicstring
735 735 param = self._paramchunk()
736 736 outdebug(self.ui, b'bundle parameter: %s' % param)
737 737 yield _pack(_fstreamparamsize, len(param))
738 738 if param:
739 739 yield param
740 740 for chunk in self._compengine.compressstream(
741 741 self._getcorechunk(), self._compopts
742 742 ):
743 743 yield chunk
744 744
745 745 def _paramchunk(self):
746 746 """return a encoded version of all stream parameters"""
747 747 blocks = []
748 748 for par, value in self._params:
749 749 par = urlreq.quote(par)
750 750 if value is not None:
751 751 value = urlreq.quote(value)
752 752 par = b'%s=%s' % (par, value)
753 753 blocks.append(par)
754 754 return b' '.join(blocks)
755 755
756 756 def _getcorechunk(self):
757 757 """yield chunk for the core part of the bundle
758 758
759 759 (all but headers and parameters)"""
760 760 outdebug(self.ui, b'start of parts')
761 761 for part in self._parts:
762 762 outdebug(self.ui, b'bundle part: "%s"' % part.type)
763 763 for chunk in part.getchunks(ui=self.ui):
764 764 yield chunk
765 765 outdebug(self.ui, b'end of bundle')
766 766 yield _pack(_fpartheadersize, 0)
767 767
768 768 def salvageoutput(self):
769 769 """return a list with a copy of all output parts in the bundle
770 770
771 771 This is meant to be used during error handling to make sure we preserve
772 772 server output"""
773 773 salvaged = []
774 774 for part in self._parts:
775 775 if part.type.startswith(b'output'):
776 776 salvaged.append(part.copy())
777 777 return salvaged
778 778
779 779
780 780 class unpackermixin:
781 781 """A mixin to extract bytes and struct data from a stream"""
782 782
783 783 def __init__(self, fp):
784 784 self._fp = fp
785 785
786 786 def _unpack(self, format):
787 787 """unpack this struct format from the stream
788 788
789 789 This method is meant for internal usage by the bundle2 protocol only.
790 790 They directly manipulate the low level stream including bundle2 level
791 791 instruction.
792 792
793 793 Do not use it to implement higher-level logic or methods."""
794 794 data = self._readexact(struct.calcsize(format))
795 795 return _unpack(format, data)
796 796
797 797 def _readexact(self, size):
798 798 """read exactly <size> bytes from the stream
799 799
800 800 This method is meant for internal usage by the bundle2 protocol only.
801 801 They directly manipulate the low level stream including bundle2 level
802 802 instruction.
803 803
804 804 Do not use it to implement higher-level logic or methods."""
805 805 return changegroup.readexactly(self._fp, size)
806 806
807 807
808 808 def getunbundler(ui, fp, magicstring=None):
809 809 """return a valid unbundler object for a given magicstring"""
810 810 if magicstring is None:
811 811 magicstring = changegroup.readexactly(fp, 4)
812 812 magic, version = magicstring[0:2], magicstring[2:4]
813 813 if magic != b'HG':
814 814 ui.debug(
815 815 b"error: invalid magic: %r (version %r), should be 'HG'\n"
816 816 % (magic, version)
817 817 )
818 818 raise error.Abort(_(b'not a Mercurial bundle'))
819 819 unbundlerclass = formatmap.get(version)
820 820 if unbundlerclass is None:
821 821 raise error.Abort(_(b'unknown bundle version %s') % version)
822 822 unbundler = unbundlerclass(ui, fp)
823 823 indebug(ui, b'start processing of %s stream' % magicstring)
824 824 return unbundler
825 825
826 826
827 827 class unbundle20(unpackermixin):
828 828 """interpret a bundle2 stream
829 829
830 830 This class is fed with a binary stream and yields parts through its
831 831 `iterparts` methods."""
832 832
833 833 _magicstring = b'HG20'
834 834
835 835 def __init__(self, ui, fp):
836 836 """If header is specified, we do not read it out of the stream."""
837 837 self.ui = ui
838 838 self._compengine = util.compengines.forbundletype(b'UN')
839 839 self._compressed = None
840 840 super(unbundle20, self).__init__(fp)
841 841
842 842 @util.propertycache
843 843 def params(self):
844 844 """dictionary of stream level parameters"""
845 845 indebug(self.ui, b'reading bundle2 stream parameters')
846 846 params = {}
847 847 paramssize = self._unpack(_fstreamparamsize)[0]
848 848 if paramssize < 0:
849 849 raise error.BundleValueError(
850 850 b'negative bundle param size: %i' % paramssize
851 851 )
852 852 if paramssize:
853 853 params = self._readexact(paramssize)
854 854 params = self._processallparams(params)
855 855 return params
856 856
857 857 def _processallparams(self, paramsblock):
858 858 """ """
859 859 params = util.sortdict()
860 860 for p in paramsblock.split(b' '):
861 861 p = p.split(b'=', 1)
862 862 p = [urlreq.unquote(i) for i in p]
863 863 if len(p) < 2:
864 864 p.append(None)
865 865 self._processparam(*p)
866 866 params[p[0]] = p[1]
867 867 return params
868 868
869 869 def _processparam(self, name, value):
870 870 """process a parameter, applying its effect if needed
871 871
872 872 Parameter starting with a lower case letter are advisory and will be
873 873 ignored when unknown. Those starting with an upper case letter are
874 874 mandatory and will this function will raise a KeyError when unknown.
875 875
876 876 Note: no option are currently supported. Any input will be either
877 877 ignored or failing.
878 878 """
879 879 if not name:
880 880 raise ValueError('empty parameter name')
881 881 if name[0:1] not in pycompat.bytestr(
882 882 string.ascii_letters # pytype: disable=wrong-arg-types
883 883 ):
884 884 raise ValueError('non letter first character: %s' % name)
885 885 try:
886 886 handler = b2streamparamsmap[name.lower()]
887 887 except KeyError:
888 888 if name[0:1].islower():
889 889 indebug(self.ui, b"ignoring unknown parameter %s" % name)
890 890 else:
891 891 raise error.BundleUnknownFeatureError(params=(name,))
892 892 else:
893 893 handler(self, name, value)
894 894
895 895 def _forwardchunks(self):
896 896 """utility to transfer a bundle2 as binary
897 897
898 898 This is made necessary by the fact the 'getbundle' command over 'ssh'
899 899 have no way to know then the reply end, relying on the bundle to be
900 900 interpreted to know its end. This is terrible and we are sorry, but we
901 901 needed to move forward to get general delta enabled.
902 902 """
903 903 yield self._magicstring
904 904 assert 'params' not in vars(self)
905 905 paramssize = self._unpack(_fstreamparamsize)[0]
906 906 if paramssize < 0:
907 907 raise error.BundleValueError(
908 908 b'negative bundle param size: %i' % paramssize
909 909 )
910 910 if paramssize:
911 911 params = self._readexact(paramssize)
912 912 self._processallparams(params)
913 913 # The payload itself is decompressed below, so drop
914 914 # the compression parameter passed down to compensate.
915 915 outparams = []
916 916 for p in params.split(b' '):
917 917 k, v = p.split(b'=', 1)
918 918 if k.lower() != b'compression':
919 919 outparams.append(p)
920 920 outparams = b' '.join(outparams)
921 921 yield _pack(_fstreamparamsize, len(outparams))
922 922 yield outparams
923 923 else:
924 924 yield _pack(_fstreamparamsize, paramssize)
925 925 # From there, payload might need to be decompressed
926 926 self._fp = self._compengine.decompressorreader(self._fp)
927 927 emptycount = 0
928 928 while emptycount < 2:
929 929 # so we can brainlessly loop
930 930 assert _fpartheadersize == _fpayloadsize
931 931 size = self._unpack(_fpartheadersize)[0]
932 932 yield _pack(_fpartheadersize, size)
933 933 if size:
934 934 emptycount = 0
935 935 else:
936 936 emptycount += 1
937 937 continue
938 938 if size == flaginterrupt:
939 939 continue
940 940 elif size < 0:
941 941 raise error.BundleValueError(b'negative chunk size: %i')
942 942 yield self._readexact(size)
943 943
944 944 def iterparts(self, seekable=False):
945 945 """yield all parts contained in the stream"""
946 946 cls = seekableunbundlepart if seekable else unbundlepart
947 947 # make sure param have been loaded
948 948 self.params
949 949 # From there, payload need to be decompressed
950 950 self._fp = self._compengine.decompressorreader(self._fp)
951 951 indebug(self.ui, b'start extraction of bundle2 parts')
952 952 headerblock = self._readpartheader()
953 953 while headerblock is not None:
954 954 part = cls(self.ui, headerblock, self._fp)
955 955 yield part
956 956 # Ensure part is fully consumed so we can start reading the next
957 957 # part.
958 958 part.consume()
959 959
960 960 headerblock = self._readpartheader()
961 961 indebug(self.ui, b'end of bundle2 stream')
962 962
963 963 def _readpartheader(self):
964 964 """reads a part header size and return the bytes blob
965 965
966 966 returns None if empty"""
967 967 headersize = self._unpack(_fpartheadersize)[0]
968 968 if headersize < 0:
969 969 raise error.BundleValueError(
970 970 b'negative part header size: %i' % headersize
971 971 )
972 972 indebug(self.ui, b'part header size: %i' % headersize)
973 973 if headersize:
974 974 return self._readexact(headersize)
975 975 return None
976 976
977 977 def compressed(self):
978 978 self.params # load params
979 979 return self._compressed
980 980
981 981 def close(self):
982 982 """close underlying file"""
983 983 if util.safehasattr(self._fp, 'close'):
984 984 return self._fp.close()
985 985
986 986
987 987 formatmap = {b'20': unbundle20}
988 988
989 989 b2streamparamsmap = {}
990 990
991 991
992 992 def b2streamparamhandler(name):
993 993 """register a handler for a stream level parameter"""
994 994
995 995 def decorator(func):
996 996 assert name not in formatmap
997 997 b2streamparamsmap[name] = func
998 998 return func
999 999
1000 1000 return decorator
1001 1001
1002 1002
1003 1003 @b2streamparamhandler(b'compression')
1004 1004 def processcompression(unbundler, param, value):
1005 1005 """read compression parameter and install payload decompression"""
1006 1006 if value not in util.compengines.supportedbundletypes:
1007 1007 raise error.BundleUnknownFeatureError(params=(param,), values=(value,))
1008 1008 unbundler._compengine = util.compengines.forbundletype(value)
1009 1009 if value is not None:
1010 1010 unbundler._compressed = True
1011 1011
1012 1012
1013 1013 class bundlepart:
1014 1014 """A bundle2 part contains application level payload
1015 1015
1016 1016 The part `type` is used to route the part to the application level
1017 1017 handler.
1018 1018
1019 1019 The part payload is contained in ``part.data``. It could be raw bytes or a
1020 1020 generator of byte chunks.
1021 1021
1022 1022 You can add parameters to the part using the ``addparam`` method.
1023 1023 Parameters can be either mandatory (default) or advisory. Remote side
1024 1024 should be able to safely ignore the advisory ones.
1025 1025
1026 1026 Both data and parameters cannot be modified after the generation has begun.
1027 1027 """
1028 1028
1029 1029 def __init__(
1030 1030 self,
1031 1031 parttype,
1032 1032 mandatoryparams=(),
1033 1033 advisoryparams=(),
1034 1034 data=b'',
1035 1035 mandatory=True,
1036 1036 ):
1037 1037 validateparttype(parttype)
1038 1038 self.id = None
1039 1039 self.type = parttype
1040 1040 self._data = data
1041 1041 self._mandatoryparams = list(mandatoryparams)
1042 1042 self._advisoryparams = list(advisoryparams)
1043 1043 # checking for duplicated entries
1044 1044 self._seenparams = set()
1045 1045 for pname, __ in self._mandatoryparams + self._advisoryparams:
1046 1046 if pname in self._seenparams:
1047 1047 raise error.ProgrammingError(b'duplicated params: %s' % pname)
1048 1048 self._seenparams.add(pname)
1049 1049 # status of the part's generation:
1050 1050 # - None: not started,
1051 1051 # - False: currently generated,
1052 1052 # - True: generation done.
1053 1053 self._generated = None
1054 1054 self.mandatory = mandatory
1055 1055
1056 1056 def __repr__(self):
1057 1057 cls = "%s.%s" % (self.__class__.__module__, self.__class__.__name__)
1058 1058 return '<%s object at %x; id: %s; type: %s; mandatory: %s>' % (
1059 1059 cls,
1060 1060 id(self),
1061 1061 self.id,
1062 1062 self.type,
1063 1063 self.mandatory,
1064 1064 )
1065 1065
1066 1066 def copy(self):
1067 1067 """return a copy of the part
1068 1068
1069 1069 The new part have the very same content but no partid assigned yet.
1070 1070 Parts with generated data cannot be copied."""
1071 1071 assert not util.safehasattr(self.data, 'next')
1072 1072 return self.__class__(
1073 1073 self.type,
1074 1074 self._mandatoryparams,
1075 1075 self._advisoryparams,
1076 1076 self._data,
1077 1077 self.mandatory,
1078 1078 )
1079 1079
1080 1080 # methods used to defines the part content
1081 1081 @property
1082 1082 def data(self):
1083 1083 return self._data
1084 1084
1085 1085 @data.setter
1086 1086 def data(self, data):
1087 1087 if self._generated is not None:
1088 1088 raise error.ReadOnlyPartError(b'part is being generated')
1089 1089 self._data = data
1090 1090
1091 1091 @property
1092 1092 def mandatoryparams(self):
1093 1093 # make it an immutable tuple to force people through ``addparam``
1094 1094 return tuple(self._mandatoryparams)
1095 1095
1096 1096 @property
1097 1097 def advisoryparams(self):
1098 1098 # make it an immutable tuple to force people through ``addparam``
1099 1099 return tuple(self._advisoryparams)
1100 1100
1101 1101 def addparam(self, name, value=b'', mandatory=True):
1102 1102 """add a parameter to the part
1103 1103
1104 1104 If 'mandatory' is set to True, the remote handler must claim support
1105 1105 for this parameter or the unbundling will be aborted.
1106 1106
1107 1107 The 'name' and 'value' cannot exceed 255 bytes each.
1108 1108 """
1109 1109 if self._generated is not None:
1110 1110 raise error.ReadOnlyPartError(b'part is being generated')
1111 1111 if name in self._seenparams:
1112 1112 raise ValueError(b'duplicated params: %s' % name)
1113 1113 self._seenparams.add(name)
1114 1114 params = self._advisoryparams
1115 1115 if mandatory:
1116 1116 params = self._mandatoryparams
1117 1117 params.append((name, value))
1118 1118
1119 1119 # methods used to generates the bundle2 stream
1120 1120 def getchunks(self, ui):
1121 1121 if self._generated is not None:
1122 1122 raise error.ProgrammingError(b'part can only be consumed once')
1123 1123 self._generated = False
1124 1124
1125 1125 if ui.debugflag:
1126 1126 msg = [b'bundle2-output-part: "%s"' % self.type]
1127 1127 if not self.mandatory:
1128 1128 msg.append(b' (advisory)')
1129 1129 nbmp = len(self.mandatoryparams)
1130 1130 nbap = len(self.advisoryparams)
1131 1131 if nbmp or nbap:
1132 1132 msg.append(b' (params:')
1133 1133 if nbmp:
1134 1134 msg.append(b' %i mandatory' % nbmp)
1135 1135 if nbap:
1136 1136 msg.append(b' %i advisory' % nbmp)
1137 1137 msg.append(b')')
1138 1138 if not self.data:
1139 1139 msg.append(b' empty payload')
1140 1140 elif util.safehasattr(self.data, 'next') or util.safehasattr(
1141 1141 self.data, b'__next__'
1142 1142 ):
1143 1143 msg.append(b' streamed payload')
1144 1144 else:
1145 1145 msg.append(b' %i bytes payload' % len(self.data))
1146 1146 msg.append(b'\n')
1147 1147 ui.debug(b''.join(msg))
1148 1148
1149 1149 #### header
1150 1150 if self.mandatory:
1151 1151 parttype = self.type.upper()
1152 1152 else:
1153 1153 parttype = self.type.lower()
1154 1154 outdebug(ui, b'part %s: "%s"' % (pycompat.bytestr(self.id), parttype))
1155 1155 ## parttype
1156 1156 header = [
1157 1157 _pack(_fparttypesize, len(parttype)),
1158 1158 parttype,
1159 1159 _pack(_fpartid, self.id),
1160 1160 ]
1161 1161 ## parameters
1162 1162 # count
1163 1163 manpar = self.mandatoryparams
1164 1164 advpar = self.advisoryparams
1165 1165 header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))
1166 1166 # size
1167 1167 parsizes = []
1168 1168 for key, value in manpar:
1169 1169 parsizes.append(len(key))
1170 1170 parsizes.append(len(value))
1171 1171 for key, value in advpar:
1172 1172 parsizes.append(len(key))
1173 1173 parsizes.append(len(value))
1174 1174 paramsizes = _pack(_makefpartparamsizes(len(parsizes) // 2), *parsizes)
1175 1175 header.append(paramsizes)
1176 1176 # key, value
1177 1177 for key, value in manpar:
1178 1178 header.append(key)
1179 1179 header.append(value)
1180 1180 for key, value in advpar:
1181 1181 header.append(key)
1182 1182 header.append(value)
1183 1183 ## finalize header
1184 1184 try:
1185 1185 headerchunk = b''.join(header)
1186 1186 except TypeError:
1187 1187 raise TypeError(
1188 1188 'Found a non-bytes trying to '
1189 1189 'build bundle part header: %r' % header
1190 1190 )
1191 1191 outdebug(ui, b'header chunk size: %i' % len(headerchunk))
1192 1192 yield _pack(_fpartheadersize, len(headerchunk))
1193 1193 yield headerchunk
1194 1194 ## payload
1195 1195 try:
1196 1196 for chunk in self._payloadchunks():
1197 1197 outdebug(ui, b'payload chunk size: %i' % len(chunk))
1198 1198 yield _pack(_fpayloadsize, len(chunk))
1199 1199 yield chunk
1200 1200 except GeneratorExit:
1201 1201 # GeneratorExit means that nobody is listening for our
1202 1202 # results anyway, so just bail quickly rather than trying
1203 1203 # to produce an error part.
1204 1204 ui.debug(b'bundle2-generatorexit\n')
1205 1205 raise
1206 1206 except BaseException as exc:
1207 1207 bexc = stringutil.forcebytestr(exc)
1208 1208 # backup exception data for later
1209 1209 ui.debug(
1210 1210 b'bundle2-input-stream-interrupt: encoding exception %s' % bexc
1211 1211 )
1212 1212 tb = sys.exc_info()[2]
1213 1213 msg = b'unexpected error: %s' % bexc
1214 1214 interpart = bundlepart(
1215 1215 b'error:abort', [(b'message', msg)], mandatory=False
1216 1216 )
1217 1217 interpart.id = 0
1218 1218 yield _pack(_fpayloadsize, -1)
1219 1219 for chunk in interpart.getchunks(ui=ui):
1220 1220 yield chunk
1221 1221 outdebug(ui, b'closing payload chunk')
1222 1222 # abort current part payload
1223 1223 yield _pack(_fpayloadsize, 0)
1224 1224 pycompat.raisewithtb(exc, tb)
1225 1225 # end of payload
1226 1226 outdebug(ui, b'closing payload chunk')
1227 1227 yield _pack(_fpayloadsize, 0)
1228 1228 self._generated = True
1229 1229
1230 1230 def _payloadchunks(self):
1231 1231 """yield chunks of a the part payload
1232 1232
1233 1233 Exists to handle the different methods to provide data to a part."""
1234 1234 # we only support fixed size data now.
1235 1235 # This will be improved in the future.
1236 1236 if util.safehasattr(self.data, 'next') or util.safehasattr(
1237 1237 self.data, '__next__'
1238 1238 ):
1239 1239 buff = util.chunkbuffer(self.data)
1240 1240 chunk = buff.read(preferedchunksize)
1241 1241 while chunk:
1242 1242 yield chunk
1243 1243 chunk = buff.read(preferedchunksize)
1244 1244 elif len(self.data):
1245 1245 yield self.data
1246 1246
1247 1247
1248 1248 flaginterrupt = -1
1249 1249
1250 1250
1251 1251 class interrupthandler(unpackermixin):
1252 1252 """read one part and process it with restricted capability
1253 1253
1254 1254 This allows to transmit exception raised on the producer size during part
1255 1255 iteration while the consumer is reading a part.
1256 1256
1257 1257 Part processed in this manner only have access to a ui object,"""
1258 1258
1259 1259 def __init__(self, ui, fp):
1260 1260 super(interrupthandler, self).__init__(fp)
1261 1261 self.ui = ui
1262 1262
1263 1263 def _readpartheader(self):
1264 1264 """reads a part header size and return the bytes blob
1265 1265
1266 1266 returns None if empty"""
1267 1267 headersize = self._unpack(_fpartheadersize)[0]
1268 1268 if headersize < 0:
1269 1269 raise error.BundleValueError(
1270 1270 b'negative part header size: %i' % headersize
1271 1271 )
1272 1272 indebug(self.ui, b'part header size: %i\n' % headersize)
1273 1273 if headersize:
1274 1274 return self._readexact(headersize)
1275 1275 return None
1276 1276
1277 1277 def __call__(self):
1278 1278
1279 1279 self.ui.debug(
1280 1280 b'bundle2-input-stream-interrupt: opening out of band context\n'
1281 1281 )
1282 1282 indebug(self.ui, b'bundle2 stream interruption, looking for a part.')
1283 1283 headerblock = self._readpartheader()
1284 1284 if headerblock is None:
1285 1285 indebug(self.ui, b'no part found during interruption.')
1286 1286 return
1287 1287 part = unbundlepart(self.ui, headerblock, self._fp)
1288 1288 op = interruptoperation(self.ui)
1289 1289 hardabort = False
1290 1290 try:
1291 1291 _processpart(op, part)
1292 1292 except (SystemExit, KeyboardInterrupt):
1293 1293 hardabort = True
1294 1294 raise
1295 1295 finally:
1296 1296 if not hardabort:
1297 1297 part.consume()
1298 1298 self.ui.debug(
1299 1299 b'bundle2-input-stream-interrupt: closing out of band context\n'
1300 1300 )
1301 1301
1302 1302
1303 1303 class interruptoperation:
1304 1304 """A limited operation to be use by part handler during interruption
1305 1305
1306 1306 It only have access to an ui object.
1307 1307 """
1308 1308
1309 1309 def __init__(self, ui):
1310 1310 self.ui = ui
1311 1311 self.reply = None
1312 1312 self.captureoutput = False
1313 1313
1314 1314 @property
1315 1315 def repo(self):
1316 1316 raise error.ProgrammingError(b'no repo access from stream interruption')
1317 1317
1318 1318 def gettransaction(self):
1319 1319 raise TransactionUnavailable(b'no repo access from stream interruption')
1320 1320
1321 1321
1322 1322 def decodepayloadchunks(ui, fh):
1323 1323 """Reads bundle2 part payload data into chunks.
1324 1324
1325 1325 Part payload data consists of framed chunks. This function takes
1326 1326 a file handle and emits those chunks.
1327 1327 """
1328 1328 dolog = ui.configbool(b'devel', b'bundle2.debug')
1329 1329 debug = ui.debug
1330 1330
1331 1331 headerstruct = struct.Struct(_fpayloadsize)
1332 1332 headersize = headerstruct.size
1333 1333 unpack = headerstruct.unpack
1334 1334
1335 1335 readexactly = changegroup.readexactly
1336 1336 read = fh.read
1337 1337
1338 1338 chunksize = unpack(readexactly(fh, headersize))[0]
1339 1339 indebug(ui, b'payload chunk size: %i' % chunksize)
1340 1340
1341 1341 # changegroup.readexactly() is inlined below for performance.
1342 1342 while chunksize:
1343 1343 if chunksize >= 0:
1344 1344 s = read(chunksize)
1345 1345 if len(s) < chunksize:
1346 1346 raise error.Abort(
1347 1347 _(
1348 1348 b'stream ended unexpectedly '
1349 1349 b' (got %d bytes, expected %d)'
1350 1350 )
1351 1351 % (len(s), chunksize)
1352 1352 )
1353 1353
1354 1354 yield s
1355 1355 elif chunksize == flaginterrupt:
1356 1356 # Interrupt "signal" detected. The regular stream is interrupted
1357 1357 # and a bundle2 part follows. Consume it.
1358 1358 interrupthandler(ui, fh)()
1359 1359 else:
1360 1360 raise error.BundleValueError(
1361 1361 b'negative payload chunk size: %s' % chunksize
1362 1362 )
1363 1363
1364 1364 s = read(headersize)
1365 1365 if len(s) < headersize:
1366 1366 raise error.Abort(
1367 1367 _(b'stream ended unexpectedly (got %d bytes, expected %d)')
1368 1368 % (len(s), chunksize)
1369 1369 )
1370 1370
1371 1371 chunksize = unpack(s)[0]
1372 1372
1373 1373 # indebug() inlined for performance.
1374 1374 if dolog:
1375 1375 debug(b'bundle2-input: payload chunk size: %i\n' % chunksize)
1376 1376
1377 1377
1378 1378 class unbundlepart(unpackermixin):
1379 1379 """a bundle part read from a bundle"""
1380 1380
1381 1381 def __init__(self, ui, header, fp):
1382 1382 super(unbundlepart, self).__init__(fp)
1383 1383 self._seekable = util.safehasattr(fp, 'seek') and util.safehasattr(
1384 fp, b'tell'
1384 fp, 'tell'
1385 1385 )
1386 1386 self.ui = ui
1387 1387 # unbundle state attr
1388 1388 self._headerdata = header
1389 1389 self._headeroffset = 0
1390 1390 self._initialized = False
1391 1391 self.consumed = False
1392 1392 # part data
1393 1393 self.id = None
1394 1394 self.type = None
1395 1395 self.mandatoryparams = None
1396 1396 self.advisoryparams = None
1397 1397 self.params = None
1398 1398 self.mandatorykeys = ()
1399 1399 self._readheader()
1400 1400 self._mandatory = None
1401 1401 self._pos = 0
1402 1402
1403 1403 def _fromheader(self, size):
1404 1404 """return the next <size> byte from the header"""
1405 1405 offset = self._headeroffset
1406 1406 data = self._headerdata[offset : (offset + size)]
1407 1407 self._headeroffset = offset + size
1408 1408 return data
1409 1409
1410 1410 def _unpackheader(self, format):
1411 1411 """read given format from header
1412 1412
1413 1413 This automatically compute the size of the format to read."""
1414 1414 data = self._fromheader(struct.calcsize(format))
1415 1415 return _unpack(format, data)
1416 1416
1417 1417 def _initparams(self, mandatoryparams, advisoryparams):
1418 1418 """internal function to setup all logic related parameters"""
1419 1419 # make it read only to prevent people touching it by mistake.
1420 1420 self.mandatoryparams = tuple(mandatoryparams)
1421 1421 self.advisoryparams = tuple(advisoryparams)
1422 1422 # user friendly UI
1423 1423 self.params = util.sortdict(self.mandatoryparams)
1424 1424 self.params.update(self.advisoryparams)
1425 1425 self.mandatorykeys = frozenset(p[0] for p in mandatoryparams)
1426 1426
1427 1427 def _readheader(self):
1428 1428 """read the header and setup the object"""
1429 1429 typesize = self._unpackheader(_fparttypesize)[0]
1430 1430 self.type = self._fromheader(typesize)
1431 1431 indebug(self.ui, b'part type: "%s"' % self.type)
1432 1432 self.id = self._unpackheader(_fpartid)[0]
1433 1433 indebug(self.ui, b'part id: "%s"' % pycompat.bytestr(self.id))
1434 1434 # extract mandatory bit from type
1435 1435 self.mandatory = self.type != self.type.lower()
1436 1436 self.type = self.type.lower()
1437 1437 ## reading parameters
1438 1438 # param count
1439 1439 mancount, advcount = self._unpackheader(_fpartparamcount)
1440 1440 indebug(self.ui, b'part parameters: %i' % (mancount + advcount))
1441 1441 # param size
1442 1442 fparamsizes = _makefpartparamsizes(mancount + advcount)
1443 1443 paramsizes = self._unpackheader(fparamsizes)
1444 1444 # make it a list of couple again
1445 1445 paramsizes = list(zip(paramsizes[::2], paramsizes[1::2]))
1446 1446 # split mandatory from advisory
1447 1447 mansizes = paramsizes[:mancount]
1448 1448 advsizes = paramsizes[mancount:]
1449 1449 # retrieve param value
1450 1450 manparams = []
1451 1451 for key, value in mansizes:
1452 1452 manparams.append((self._fromheader(key), self._fromheader(value)))
1453 1453 advparams = []
1454 1454 for key, value in advsizes:
1455 1455 advparams.append((self._fromheader(key), self._fromheader(value)))
1456 1456 self._initparams(manparams, advparams)
1457 1457 ## part payload
1458 1458 self._payloadstream = util.chunkbuffer(self._payloadchunks())
1459 1459 # we read the data, tell it
1460 1460 self._initialized = True
1461 1461
1462 1462 def _payloadchunks(self):
1463 1463 """Generator of decoded chunks in the payload."""
1464 1464 return decodepayloadchunks(self.ui, self._fp)
1465 1465
1466 1466 def consume(self):
1467 1467 """Read the part payload until completion.
1468 1468
1469 1469 By consuming the part data, the underlying stream read offset will
1470 1470 be advanced to the next part (or end of stream).
1471 1471 """
1472 1472 if self.consumed:
1473 1473 return
1474 1474
1475 1475 chunk = self.read(32768)
1476 1476 while chunk:
1477 1477 self._pos += len(chunk)
1478 1478 chunk = self.read(32768)
1479 1479
1480 1480 def read(self, size=None):
1481 1481 """read payload data"""
1482 1482 if not self._initialized:
1483 1483 self._readheader()
1484 1484 if size is None:
1485 1485 data = self._payloadstream.read()
1486 1486 else:
1487 1487 data = self._payloadstream.read(size)
1488 1488 self._pos += len(data)
1489 1489 if size is None or len(data) < size:
1490 1490 if not self.consumed and self._pos:
1491 1491 self.ui.debug(
1492 1492 b'bundle2-input-part: total payload size %i\n' % self._pos
1493 1493 )
1494 1494 self.consumed = True
1495 1495 return data
1496 1496
1497 1497
1498 1498 class seekableunbundlepart(unbundlepart):
1499 1499 """A bundle2 part in a bundle that is seekable.
1500 1500
1501 1501 Regular ``unbundlepart`` instances can only be read once. This class
1502 1502 extends ``unbundlepart`` to enable bi-directional seeking within the
1503 1503 part.
1504 1504
1505 1505 Bundle2 part data consists of framed chunks. Offsets when seeking
1506 1506 refer to the decoded data, not the offsets in the underlying bundle2
1507 1507 stream.
1508 1508
1509 1509 To facilitate quickly seeking within the decoded data, instances of this
1510 1510 class maintain a mapping between offsets in the underlying stream and
1511 1511 the decoded payload. This mapping will consume memory in proportion
1512 1512 to the number of chunks within the payload (which almost certainly
1513 1513 increases in proportion with the size of the part).
1514 1514 """
1515 1515
1516 1516 def __init__(self, ui, header, fp):
1517 1517 # (payload, file) offsets for chunk starts.
1518 1518 self._chunkindex = []
1519 1519
1520 1520 super(seekableunbundlepart, self).__init__(ui, header, fp)
1521 1521
1522 1522 def _payloadchunks(self, chunknum=0):
1523 1523 '''seek to specified chunk and start yielding data'''
1524 1524 if len(self._chunkindex) == 0:
1525 1525 assert chunknum == 0, b'Must start with chunk 0'
1526 1526 self._chunkindex.append((0, self._tellfp()))
1527 1527 else:
1528 1528 assert chunknum < len(self._chunkindex), (
1529 1529 b'Unknown chunk %d' % chunknum
1530 1530 )
1531 1531 self._seekfp(self._chunkindex[chunknum][1])
1532 1532
1533 1533 pos = self._chunkindex[chunknum][0]
1534 1534
1535 1535 for chunk in decodepayloadchunks(self.ui, self._fp):
1536 1536 chunknum += 1
1537 1537 pos += len(chunk)
1538 1538 if chunknum == len(self._chunkindex):
1539 1539 self._chunkindex.append((pos, self._tellfp()))
1540 1540
1541 1541 yield chunk
1542 1542
1543 1543 def _findchunk(self, pos):
1544 1544 '''for a given payload position, return a chunk number and offset'''
1545 1545 for chunk, (ppos, fpos) in enumerate(self._chunkindex):
1546 1546 if ppos == pos:
1547 1547 return chunk, 0
1548 1548 elif ppos > pos:
1549 1549 return chunk - 1, pos - self._chunkindex[chunk - 1][0]
1550 1550 raise ValueError(b'Unknown chunk')
1551 1551
1552 1552 def tell(self):
1553 1553 return self._pos
1554 1554
1555 1555 def seek(self, offset, whence=os.SEEK_SET):
1556 1556 if whence == os.SEEK_SET:
1557 1557 newpos = offset
1558 1558 elif whence == os.SEEK_CUR:
1559 1559 newpos = self._pos + offset
1560 1560 elif whence == os.SEEK_END:
1561 1561 if not self.consumed:
1562 1562 # Can't use self.consume() here because it advances self._pos.
1563 1563 chunk = self.read(32768)
1564 1564 while chunk:
1565 1565 chunk = self.read(32768)
1566 1566 newpos = self._chunkindex[-1][0] - offset
1567 1567 else:
1568 1568 raise ValueError(b'Unknown whence value: %r' % (whence,))
1569 1569
1570 1570 if newpos > self._chunkindex[-1][0] and not self.consumed:
1571 1571 # Can't use self.consume() here because it advances self._pos.
1572 1572 chunk = self.read(32768)
1573 1573 while chunk:
1574 1574 chunk = self.read(32668)
1575 1575
1576 1576 if not 0 <= newpos <= self._chunkindex[-1][0]:
1577 1577 raise ValueError(b'Offset out of range')
1578 1578
1579 1579 if self._pos != newpos:
1580 1580 chunk, internaloffset = self._findchunk(newpos)
1581 1581 self._payloadstream = util.chunkbuffer(self._payloadchunks(chunk))
1582 1582 adjust = self.read(internaloffset)
1583 1583 if len(adjust) != internaloffset:
1584 1584 raise error.Abort(_(b'Seek failed\n'))
1585 1585 self._pos = newpos
1586 1586
1587 1587 def _seekfp(self, offset, whence=0):
1588 1588 """move the underlying file pointer
1589 1589
1590 1590 This method is meant for internal usage by the bundle2 protocol only.
1591 1591 They directly manipulate the low level stream including bundle2 level
1592 1592 instruction.
1593 1593
1594 1594 Do not use it to implement higher-level logic or methods."""
1595 1595 if self._seekable:
1596 1596 return self._fp.seek(offset, whence)
1597 1597 else:
1598 1598 raise NotImplementedError(_(b'File pointer is not seekable'))
1599 1599
1600 1600 def _tellfp(self):
1601 1601 """return the file offset, or None if file is not seekable
1602 1602
1603 1603 This method is meant for internal usage by the bundle2 protocol only.
1604 1604 They directly manipulate the low level stream including bundle2 level
1605 1605 instruction.
1606 1606
1607 1607 Do not use it to implement higher-level logic or methods."""
1608 1608 if self._seekable:
1609 1609 try:
1610 1610 return self._fp.tell()
1611 1611 except IOError as e:
1612 1612 if e.errno == errno.ESPIPE:
1613 1613 self._seekable = False
1614 1614 else:
1615 1615 raise
1616 1616 return None
1617 1617
1618 1618
1619 1619 # These are only the static capabilities.
1620 1620 # Check the 'getrepocaps' function for the rest.
1621 1621 capabilities = {
1622 1622 b'HG20': (),
1623 1623 b'bookmarks': (),
1624 1624 b'error': (b'abort', b'unsupportedcontent', b'pushraced', b'pushkey'),
1625 1625 b'listkeys': (),
1626 1626 b'pushkey': (),
1627 1627 b'digests': tuple(sorted(util.DIGESTS.keys())),
1628 1628 b'remote-changegroup': (b'http', b'https'),
1629 1629 b'hgtagsfnodes': (),
1630 1630 b'phases': (b'heads',),
1631 1631 b'stream': (b'v2',),
1632 1632 }
1633 1633
1634 1634
1635 1635 def getrepocaps(repo, allowpushback=False, role=None):
1636 1636 """return the bundle2 capabilities for a given repo
1637 1637
1638 1638 Exists to allow extensions (like evolution) to mutate the capabilities.
1639 1639
1640 1640 The returned value is used for servers advertising their capabilities as
1641 1641 well as clients advertising their capabilities to servers as part of
1642 1642 bundle2 requests. The ``role`` argument specifies which is which.
1643 1643 """
1644 1644 if role not in (b'client', b'server'):
1645 1645 raise error.ProgrammingError(b'role argument must be client or server')
1646 1646
1647 1647 caps = capabilities.copy()
1648 1648 caps[b'changegroup'] = tuple(
1649 1649 sorted(changegroup.supportedincomingversions(repo))
1650 1650 )
1651 1651 if obsolete.isenabled(repo, obsolete.exchangeopt):
1652 1652 supportedformat = tuple(b'V%i' % v for v in obsolete.formats)
1653 1653 caps[b'obsmarkers'] = supportedformat
1654 1654 if allowpushback:
1655 1655 caps[b'pushback'] = ()
1656 1656 cpmode = repo.ui.config(b'server', b'concurrent-push-mode')
1657 1657 if cpmode == b'check-related':
1658 1658 caps[b'checkheads'] = (b'related',)
1659 1659 if b'phases' in repo.ui.configlist(b'devel', b'legacy.exchange'):
1660 1660 caps.pop(b'phases')
1661 1661
1662 1662 # Don't advertise stream clone support in server mode if not configured.
1663 1663 if role == b'server':
1664 1664 streamsupported = repo.ui.configbool(
1665 1665 b'server', b'uncompressed', untrusted=True
1666 1666 )
1667 1667 featuresupported = repo.ui.configbool(b'server', b'bundle2.stream')
1668 1668
1669 1669 if not streamsupported or not featuresupported:
1670 1670 caps.pop(b'stream')
1671 1671 # Else always advertise support on client, because payload support
1672 1672 # should always be advertised.
1673 1673
1674 1674 if repo.ui.configbool(b'experimental', b'stream-v3'):
1675 1675 if b'stream' in caps:
1676 1676 caps[b'stream'] += (b'v3-exp',)
1677 1677
1678 1678 # b'rev-branch-cache is no longer advertised, but still supported
1679 1679 # for legacy clients.
1680 1680
1681 1681 return caps
1682 1682
1683 1683
1684 1684 def bundle2caps(remote):
1685 1685 """return the bundle capabilities of a peer as dict"""
1686 1686 raw = remote.capable(b'bundle2')
1687 1687 if not raw and raw != b'':
1688 1688 return {}
1689 1689 capsblob = urlreq.unquote(remote.capable(b'bundle2'))
1690 1690 return decodecaps(capsblob)
1691 1691
1692 1692
1693 1693 def obsmarkersversion(caps):
1694 1694 """extract the list of supported obsmarkers versions from a bundle2caps dict"""
1695 1695 obscaps = caps.get(b'obsmarkers', ())
1696 1696 return [int(c[1:]) for c in obscaps if c.startswith(b'V')]
1697 1697
1698 1698
1699 1699 def writenewbundle(
1700 1700 ui,
1701 1701 repo,
1702 1702 source,
1703 1703 filename,
1704 1704 bundletype,
1705 1705 outgoing,
1706 1706 opts,
1707 1707 vfs=None,
1708 1708 compression=None,
1709 1709 compopts=None,
1710 1710 allow_internal=False,
1711 1711 ):
1712 1712 if bundletype.startswith(b'HG10'):
1713 1713 cg = changegroup.makechangegroup(repo, outgoing, b'01', source)
1714 1714 return writebundle(
1715 1715 ui,
1716 1716 cg,
1717 1717 filename,
1718 1718 bundletype,
1719 1719 vfs=vfs,
1720 1720 compression=compression,
1721 1721 compopts=compopts,
1722 1722 )
1723 1723 elif not bundletype.startswith(b'HG20'):
1724 1724 raise error.ProgrammingError(b'unknown bundle type: %s' % bundletype)
1725 1725
1726 1726 # enforce that no internal phase are to be bundled
1727 1727 bundled_internal = repo.revs(b"%ln and _internal()", outgoing.ancestorsof)
1728 1728 if bundled_internal and not allow_internal:
1729 1729 count = len(repo.revs(b'%ln and _internal()', outgoing.missing))
1730 1730 msg = "backup bundle would contains %d internal changesets"
1731 1731 msg %= count
1732 1732 raise error.ProgrammingError(msg)
1733 1733
1734 1734 caps = {}
1735 1735 if opts.get(b'obsolescence', False):
1736 1736 caps[b'obsmarkers'] = (b'V1',)
1737 1737 if opts.get(b'streamv2'):
1738 1738 caps[b'stream'] = [b'v2']
1739 1739 elif opts.get(b'streamv3-exp'):
1740 1740 caps[b'stream'] = [b'v3-exp']
1741 1741 bundle = bundle20(ui, caps)
1742 1742 bundle.setcompression(compression, compopts)
1743 1743 _addpartsfromopts(ui, repo, bundle, source, outgoing, opts)
1744 1744 chunkiter = bundle.getchunks()
1745 1745
1746 1746 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
1747 1747
1748 1748
1749 1749 def _addpartsfromopts(ui, repo, bundler, source, outgoing, opts):
1750 1750 # We should eventually reconcile this logic with the one behind
1751 1751 # 'exchange.getbundle2partsgenerator'.
1752 1752 #
1753 1753 # The type of input from 'getbundle' and 'writenewbundle' are a bit
1754 1754 # different right now. So we keep them separated for now for the sake of
1755 1755 # simplicity.
1756 1756
1757 1757 # we might not always want a changegroup in such bundle, for example in
1758 1758 # stream bundles
1759 1759 if opts.get(b'changegroup', True):
1760 1760 cgversion = opts.get(b'cg.version')
1761 1761 if cgversion is None:
1762 1762 cgversion = changegroup.safeversion(repo)
1763 1763 cg = changegroup.makechangegroup(repo, outgoing, cgversion, source)
1764 1764 part = bundler.newpart(b'changegroup', data=cg.getchunks())
1765 1765 part.addparam(b'version', cg.version)
1766 1766 if b'clcount' in cg.extras:
1767 1767 part.addparam(
1768 1768 b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
1769 1769 )
1770 1770 if opts.get(b'phases'):
1771 1771 target_phase = phases.draft
1772 1772 for head in outgoing.ancestorsof:
1773 1773 target_phase = max(target_phase, repo[head].phase())
1774 1774 if target_phase > phases.draft:
1775 1775 part.addparam(
1776 1776 b'targetphase',
1777 1777 b'%d' % target_phase,
1778 1778 mandatory=False,
1779 1779 )
1780 1780 if repository.REPO_FEATURE_SIDE_DATA in repo.features:
1781 1781 part.addparam(b'exp-sidedata', b'1')
1782 1782
1783 1783 if opts.get(b'streamv2', False):
1784 1784 addpartbundlestream2(bundler, repo, stream=True)
1785 1785
1786 1786 if opts.get(b'streamv3-exp', False):
1787 1787 addpartbundlestream2(bundler, repo, stream=True)
1788 1788
1789 1789 if opts.get(b'tagsfnodescache', True):
1790 1790 addparttagsfnodescache(repo, bundler, outgoing)
1791 1791
1792 1792 if opts.get(b'revbranchcache', True):
1793 1793 addpartrevbranchcache(repo, bundler, outgoing)
1794 1794
1795 1795 if opts.get(b'obsolescence', False):
1796 1796 obsmarkers = repo.obsstore.relevantmarkers(outgoing.missing)
1797 1797 buildobsmarkerspart(
1798 1798 bundler,
1799 1799 obsmarkers,
1800 1800 mandatory=opts.get(b'obsolescence-mandatory', True),
1801 1801 )
1802 1802
1803 1803 if opts.get(b'phases', False):
1804 1804 headsbyphase = phases.subsetphaseheads(repo, outgoing.missing)
1805 1805 phasedata = phases.binaryencode(headsbyphase)
1806 1806 bundler.newpart(b'phase-heads', data=phasedata)
1807 1807
1808 1808
1809 1809 def addparttagsfnodescache(repo, bundler, outgoing):
1810 1810 # we include the tags fnode cache for the bundle changeset
1811 1811 # (as an optional parts)
1812 1812 cache = tags.hgtagsfnodescache(repo.unfiltered())
1813 1813 chunks = []
1814 1814
1815 1815 # .hgtags fnodes are only relevant for head changesets. While we could
1816 1816 # transfer values for all known nodes, there will likely be little to
1817 1817 # no benefit.
1818 1818 #
1819 1819 # We don't bother using a generator to produce output data because
1820 1820 # a) we only have 40 bytes per head and even esoteric numbers of heads
1821 1821 # consume little memory (1M heads is 40MB) b) we don't want to send the
1822 1822 # part if we don't have entries and knowing if we have entries requires
1823 1823 # cache lookups.
1824 1824 for node in outgoing.ancestorsof:
1825 1825 # Don't compute missing, as this may slow down serving.
1826 1826 fnode = cache.getfnode(node, computemissing=False)
1827 1827 if fnode:
1828 1828 chunks.extend([node, fnode])
1829 1829
1830 1830 if chunks:
1831 1831 bundler.newpart(b'hgtagsfnodes', data=b''.join(chunks))
1832 1832
1833 1833
1834 1834 def addpartrevbranchcache(repo, bundler, outgoing):
1835 1835 # we include the rev branch cache for the bundle changeset
1836 1836 # (as an optional parts)
1837 1837 cache = repo.revbranchcache()
1838 1838 cl = repo.unfiltered().changelog
1839 1839 branchesdata = collections.defaultdict(lambda: (set(), set()))
1840 1840 for node in outgoing.missing:
1841 1841 branch, close = cache.branchinfo(cl.rev(node))
1842 1842 branchesdata[branch][close].add(node)
1843 1843
1844 1844 def generate():
1845 1845 for branch, (nodes, closed) in sorted(branchesdata.items()):
1846 1846 utf8branch = encoding.fromlocal(branch)
1847 1847 yield rbcstruct.pack(len(utf8branch), len(nodes), len(closed))
1848 1848 yield utf8branch
1849 1849 for n in sorted(nodes):
1850 1850 yield n
1851 1851 for n in sorted(closed):
1852 1852 yield n
1853 1853
1854 1854 bundler.newpart(b'cache:rev-branch-cache', data=generate(), mandatory=False)
1855 1855
1856 1856
1857 1857 def _formatrequirementsspec(requirements):
1858 1858 requirements = [req for req in requirements if req != b"shared"]
1859 1859 return urlreq.quote(b','.join(sorted(requirements)))
1860 1860
1861 1861
1862 1862 def _formatrequirementsparams(requirements):
1863 1863 requirements = _formatrequirementsspec(requirements)
1864 1864 params = b"%s%s" % (urlreq.quote(b"requirements="), requirements)
1865 1865 return params
1866 1866
1867 1867
1868 1868 def format_remote_wanted_sidedata(repo):
1869 1869 """Formats a repo's wanted sidedata categories into a bytestring for
1870 1870 capabilities exchange."""
1871 1871 wanted = b""
1872 1872 if repo._wanted_sidedata:
1873 1873 wanted = b','.join(
1874 1874 pycompat.bytestr(c) for c in sorted(repo._wanted_sidedata)
1875 1875 )
1876 1876 return wanted
1877 1877
1878 1878
1879 1879 def read_remote_wanted_sidedata(remote):
1880 1880 sidedata_categories = remote.capable(b'exp-wanted-sidedata')
1881 1881 return read_wanted_sidedata(sidedata_categories)
1882 1882
1883 1883
1884 1884 def read_wanted_sidedata(formatted):
1885 1885 if formatted:
1886 1886 return set(formatted.split(b','))
1887 1887 return set()
1888 1888
1889 1889
1890 1890 def addpartbundlestream2(bundler, repo, **kwargs):
1891 1891 if not kwargs.get('stream', False):
1892 1892 return
1893 1893
1894 1894 if not streamclone.allowservergeneration(repo):
1895 1895 msg = _(b'stream data requested but server does not allow this feature')
1896 1896 hint = _(b'the client seems buggy')
1897 1897 raise error.Abort(msg, hint=hint)
1898 1898 if not (b'stream' in bundler.capabilities):
1899 1899 msg = _(
1900 1900 b'stream data requested but supported streaming clone versions were not specified'
1901 1901 )
1902 1902 hint = _(b'the client seems buggy')
1903 1903 raise error.Abort(msg, hint=hint)
1904 1904 client_supported = set(bundler.capabilities[b'stream'])
1905 1905 server_supported = set(getrepocaps(repo, role=b'client').get(b'stream', []))
1906 1906 common_supported = client_supported & server_supported
1907 1907 if not common_supported:
1908 1908 msg = _(b'no common supported version with the client: %s; %s')
1909 1909 str_server = b','.join(sorted(server_supported))
1910 1910 str_client = b','.join(sorted(client_supported))
1911 1911 msg %= (str_server, str_client)
1912 1912 raise error.Abort(msg)
1913 1913 version = max(common_supported)
1914 1914
1915 1915 # Stream clones don't compress well. And compression undermines a
1916 1916 # goal of stream clones, which is to be fast. Communicate the desire
1917 1917 # to avoid compression to consumers of the bundle.
1918 1918 bundler.prefercompressed = False
1919 1919
1920 1920 # get the includes and excludes
1921 1921 includepats = kwargs.get('includepats')
1922 1922 excludepats = kwargs.get('excludepats')
1923 1923
1924 1924 narrowstream = repo.ui.configbool(
1925 1925 b'experimental', b'server.stream-narrow-clones'
1926 1926 )
1927 1927
1928 1928 if (includepats or excludepats) and not narrowstream:
1929 1929 raise error.Abort(_(b'server does not support narrow stream clones'))
1930 1930
1931 1931 includeobsmarkers = False
1932 1932 if repo.obsstore:
1933 1933 remoteversions = obsmarkersversion(bundler.capabilities)
1934 1934 if not remoteversions:
1935 1935 raise error.Abort(
1936 1936 _(
1937 1937 b'server has obsolescence markers, but client '
1938 1938 b'cannot receive them via stream clone'
1939 1939 )
1940 1940 )
1941 1941 elif repo.obsstore._version in remoteversions:
1942 1942 includeobsmarkers = True
1943 1943
1944 1944 if version == b"v2":
1945 1945 filecount, bytecount, it = streamclone.generatev2(
1946 1946 repo, includepats, excludepats, includeobsmarkers
1947 1947 )
1948 1948 requirements = streamclone.streamed_requirements(repo)
1949 1949 requirements = _formatrequirementsspec(requirements)
1950 1950 part = bundler.newpart(b'stream2', data=it)
1951 1951 part.addparam(b'bytecount', b'%d' % bytecount, mandatory=True)
1952 1952 part.addparam(b'filecount', b'%d' % filecount, mandatory=True)
1953 1953 part.addparam(b'requirements', requirements, mandatory=True)
1954 1954 elif version == b"v3-exp":
1955 1955 filecount, bytecount, it = streamclone.generatev2(
1956 1956 repo, includepats, excludepats, includeobsmarkers
1957 1957 )
1958 1958 requirements = streamclone.streamed_requirements(repo)
1959 1959 requirements = _formatrequirementsspec(requirements)
1960 1960 part = bundler.newpart(b'stream3-exp', data=it)
1961 1961 part.addparam(b'bytecount', b'%d' % bytecount, mandatory=True)
1962 1962 part.addparam(b'filecount', b'%d' % filecount, mandatory=True)
1963 1963 part.addparam(b'requirements', requirements, mandatory=True)
1964 1964
1965 1965
1966 1966 def buildobsmarkerspart(bundler, markers, mandatory=True):
1967 1967 """add an obsmarker part to the bundler with <markers>
1968 1968
1969 1969 No part is created if markers is empty.
1970 1970 Raises ValueError if the bundler doesn't support any known obsmarker format.
1971 1971 """
1972 1972 if not markers:
1973 1973 return None
1974 1974
1975 1975 remoteversions = obsmarkersversion(bundler.capabilities)
1976 1976 version = obsolete.commonversion(remoteversions)
1977 1977 if version is None:
1978 1978 raise ValueError(b'bundler does not support common obsmarker format')
1979 1979 stream = obsolete.encodemarkers(markers, True, version=version)
1980 1980 return bundler.newpart(b'obsmarkers', data=stream, mandatory=mandatory)
1981 1981
1982 1982
1983 1983 def writebundle(
1984 1984 ui, cg, filename, bundletype, vfs=None, compression=None, compopts=None
1985 1985 ):
1986 1986 """Write a bundle file and return its filename.
1987 1987
1988 1988 Existing files will not be overwritten.
1989 1989 If no filename is specified, a temporary file is created.
1990 1990 bz2 compression can be turned off.
1991 1991 The bundle file will be deleted in case of errors.
1992 1992 """
1993 1993
1994 1994 if bundletype == b"HG20":
1995 1995 bundle = bundle20(ui)
1996 1996 bundle.setcompression(compression, compopts)
1997 1997 part = bundle.newpart(b'changegroup', data=cg.getchunks())
1998 1998 part.addparam(b'version', cg.version)
1999 1999 if b'clcount' in cg.extras:
2000 2000 part.addparam(
2001 2001 b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
2002 2002 )
2003 2003 chunkiter = bundle.getchunks()
2004 2004 else:
2005 2005 # compression argument is only for the bundle2 case
2006 2006 assert compression is None
2007 2007 if cg.version != b'01':
2008 2008 raise error.Abort(
2009 2009 _(b'old bundle types only supports v1 changegroups')
2010 2010 )
2011 2011
2012 2012 # HG20 is the case without 2 values to unpack, but is handled above.
2013 2013 # pytype: disable=bad-unpacking
2014 2014 header, comp = bundletypes[bundletype]
2015 2015 # pytype: enable=bad-unpacking
2016 2016
2017 2017 if comp not in util.compengines.supportedbundletypes:
2018 2018 raise error.Abort(_(b'unknown stream compression type: %s') % comp)
2019 2019 compengine = util.compengines.forbundletype(comp)
2020 2020
2021 2021 def chunkiter():
2022 2022 yield header
2023 2023 for chunk in compengine.compressstream(cg.getchunks(), compopts):
2024 2024 yield chunk
2025 2025
2026 2026 chunkiter = chunkiter()
2027 2027
2028 2028 # parse the changegroup data, otherwise we will block
2029 2029 # in case of sshrepo because we don't know the end of the stream
2030 2030 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
2031 2031
2032 2032
2033 2033 def combinechangegroupresults(op):
2034 2034 """logic to combine 0 or more addchangegroup results into one"""
2035 2035 results = [r.get(b'return', 0) for r in op.records[b'changegroup']]
2036 2036 changedheads = 0
2037 2037 result = 1
2038 2038 for ret in results:
2039 2039 # If any changegroup result is 0, return 0
2040 2040 if ret == 0:
2041 2041 result = 0
2042 2042 break
2043 2043 if ret < -1:
2044 2044 changedheads += ret + 1
2045 2045 elif ret > 1:
2046 2046 changedheads += ret - 1
2047 2047 if changedheads > 0:
2048 2048 result = 1 + changedheads
2049 2049 elif changedheads < 0:
2050 2050 result = -1 + changedheads
2051 2051 return result
2052 2052
2053 2053
2054 2054 @parthandler(
2055 2055 b'changegroup',
2056 2056 (
2057 2057 b'version',
2058 2058 b'nbchanges',
2059 2059 b'exp-sidedata',
2060 2060 b'exp-wanted-sidedata',
2061 2061 b'treemanifest',
2062 2062 b'targetphase',
2063 2063 ),
2064 2064 )
2065 2065 def handlechangegroup(op, inpart):
2066 2066 """apply a changegroup part on the repo"""
2067 2067 from . import localrepo
2068 2068
2069 2069 tr = op.gettransaction()
2070 2070 unpackerversion = inpart.params.get(b'version', b'01')
2071 2071 # We should raise an appropriate exception here
2072 2072 cg = changegroup.getunbundler(unpackerversion, inpart, None)
2073 2073 # the source and url passed here are overwritten by the one contained in
2074 2074 # the transaction.hookargs argument. So 'bundle2' is a placeholder
2075 2075 nbchangesets = None
2076 2076 if b'nbchanges' in inpart.params:
2077 2077 nbchangesets = int(inpart.params.get(b'nbchanges'))
2078 2078 if b'treemanifest' in inpart.params and not scmutil.istreemanifest(op.repo):
2079 2079 if len(op.repo.changelog) != 0:
2080 2080 raise error.Abort(
2081 2081 _(
2082 2082 b"bundle contains tree manifests, but local repo is "
2083 2083 b"non-empty and does not use tree manifests"
2084 2084 )
2085 2085 )
2086 2086 op.repo.requirements.add(requirements.TREEMANIFEST_REQUIREMENT)
2087 2087 op.repo.svfs.options = localrepo.resolvestorevfsoptions(
2088 2088 op.repo.ui, op.repo.requirements, op.repo.features
2089 2089 )
2090 2090 scmutil.writereporequirements(op.repo)
2091 2091
2092 2092 extrakwargs = {}
2093 2093 targetphase = inpart.params.get(b'targetphase')
2094 2094 if targetphase is not None:
2095 2095 extrakwargs['targetphase'] = int(targetphase)
2096 2096
2097 2097 remote_sidedata = inpart.params.get(b'exp-wanted-sidedata')
2098 2098 extrakwargs['sidedata_categories'] = read_wanted_sidedata(remote_sidedata)
2099 2099
2100 2100 ret = _processchangegroup(
2101 2101 op,
2102 2102 cg,
2103 2103 tr,
2104 2104 op.source,
2105 2105 b'bundle2',
2106 2106 expectedtotal=nbchangesets,
2107 2107 **extrakwargs
2108 2108 )
2109 2109 if op.reply is not None:
2110 2110 # This is definitely not the final form of this
2111 2111 # return. But one need to start somewhere.
2112 2112 part = op.reply.newpart(b'reply:changegroup', mandatory=False)
2113 2113 part.addparam(
2114 2114 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2115 2115 )
2116 2116 part.addparam(b'return', b'%i' % ret, mandatory=False)
2117 2117 assert not inpart.read()
2118 2118
2119 2119
2120 2120 _remotechangegroupparams = tuple(
2121 2121 [b'url', b'size', b'digests']
2122 2122 + [b'digest:%s' % k for k in util.DIGESTS.keys()]
2123 2123 )
2124 2124
2125 2125
2126 2126 @parthandler(b'remote-changegroup', _remotechangegroupparams)
2127 2127 def handleremotechangegroup(op, inpart):
2128 2128 """apply a bundle10 on the repo, given an url and validation information
2129 2129
2130 2130 All the information about the remote bundle to import are given as
2131 2131 parameters. The parameters include:
2132 2132 - url: the url to the bundle10.
2133 2133 - size: the bundle10 file size. It is used to validate what was
2134 2134 retrieved by the client matches the server knowledge about the bundle.
2135 2135 - digests: a space separated list of the digest types provided as
2136 2136 parameters.
2137 2137 - digest:<digest-type>: the hexadecimal representation of the digest with
2138 2138 that name. Like the size, it is used to validate what was retrieved by
2139 2139 the client matches what the server knows about the bundle.
2140 2140
2141 2141 When multiple digest types are given, all of them are checked.
2142 2142 """
2143 2143 try:
2144 2144 raw_url = inpart.params[b'url']
2145 2145 except KeyError:
2146 2146 raise error.Abort(_(b'remote-changegroup: missing "%s" param') % b'url')
2147 2147 parsed_url = urlutil.url(raw_url)
2148 2148 if parsed_url.scheme not in capabilities[b'remote-changegroup']:
2149 2149 raise error.Abort(
2150 2150 _(b'remote-changegroup does not support %s urls')
2151 2151 % parsed_url.scheme
2152 2152 )
2153 2153
2154 2154 try:
2155 2155 size = int(inpart.params[b'size'])
2156 2156 except ValueError:
2157 2157 raise error.Abort(
2158 2158 _(b'remote-changegroup: invalid value for param "%s"') % b'size'
2159 2159 )
2160 2160 except KeyError:
2161 2161 raise error.Abort(
2162 2162 _(b'remote-changegroup: missing "%s" param') % b'size'
2163 2163 )
2164 2164
2165 2165 digests = {}
2166 2166 for typ in inpart.params.get(b'digests', b'').split():
2167 2167 param = b'digest:%s' % typ
2168 2168 try:
2169 2169 value = inpart.params[param]
2170 2170 except KeyError:
2171 2171 raise error.Abort(
2172 2172 _(b'remote-changegroup: missing "%s" param') % param
2173 2173 )
2174 2174 digests[typ] = value
2175 2175
2176 2176 real_part = util.digestchecker(url.open(op.ui, raw_url), size, digests)
2177 2177
2178 2178 tr = op.gettransaction()
2179 2179 from . import exchange
2180 2180
2181 2181 cg = exchange.readbundle(op.repo.ui, real_part, raw_url)
2182 2182 if not isinstance(cg, changegroup.cg1unpacker):
2183 2183 raise error.Abort(
2184 2184 _(b'%s: not a bundle version 1.0') % urlutil.hidepassword(raw_url)
2185 2185 )
2186 2186 ret = _processchangegroup(op, cg, tr, op.source, b'bundle2')
2187 2187 if op.reply is not None:
2188 2188 # This is definitely not the final form of this
2189 2189 # return. But one need to start somewhere.
2190 2190 part = op.reply.newpart(b'reply:changegroup')
2191 2191 part.addparam(
2192 2192 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2193 2193 )
2194 2194 part.addparam(b'return', b'%i' % ret, mandatory=False)
2195 2195 try:
2196 2196 real_part.validate()
2197 2197 except error.Abort as e:
2198 2198 raise error.Abort(
2199 2199 _(b'bundle at %s is corrupted:\n%s')
2200 2200 % (urlutil.hidepassword(raw_url), e.message)
2201 2201 )
2202 2202 assert not inpart.read()
2203 2203
2204 2204
2205 2205 @parthandler(b'reply:changegroup', (b'return', b'in-reply-to'))
2206 2206 def handlereplychangegroup(op, inpart):
2207 2207 ret = int(inpart.params[b'return'])
2208 2208 replyto = int(inpart.params[b'in-reply-to'])
2209 2209 op.records.add(b'changegroup', {b'return': ret}, replyto)
2210 2210
2211 2211
2212 2212 @parthandler(b'check:bookmarks')
2213 2213 def handlecheckbookmarks(op, inpart):
2214 2214 """check location of bookmarks
2215 2215
2216 2216 This part is to be used to detect push race regarding bookmark, it
2217 2217 contains binary encoded (bookmark, node) tuple. If the local state does
2218 2218 not marks the one in the part, a PushRaced exception is raised
2219 2219 """
2220 2220 bookdata = bookmarks.binarydecode(op.repo, inpart)
2221 2221
2222 2222 msgstandard = (
2223 2223 b'remote repository changed while pushing - please try again '
2224 2224 b'(bookmark "%s" move from %s to %s)'
2225 2225 )
2226 2226 msgmissing = (
2227 2227 b'remote repository changed while pushing - please try again '
2228 2228 b'(bookmark "%s" is missing, expected %s)'
2229 2229 )
2230 2230 msgexist = (
2231 2231 b'remote repository changed while pushing - please try again '
2232 2232 b'(bookmark "%s" set on %s, expected missing)'
2233 2233 )
2234 2234 for book, node in bookdata:
2235 2235 currentnode = op.repo._bookmarks.get(book)
2236 2236 if currentnode != node:
2237 2237 if node is None:
2238 2238 finalmsg = msgexist % (book, short(currentnode))
2239 2239 elif currentnode is None:
2240 2240 finalmsg = msgmissing % (book, short(node))
2241 2241 else:
2242 2242 finalmsg = msgstandard % (
2243 2243 book,
2244 2244 short(node),
2245 2245 short(currentnode),
2246 2246 )
2247 2247 raise error.PushRaced(finalmsg)
2248 2248
2249 2249
2250 2250 @parthandler(b'check:heads')
2251 2251 def handlecheckheads(op, inpart):
2252 2252 """check that head of the repo did not change
2253 2253
2254 2254 This is used to detect a push race when using unbundle.
2255 2255 This replaces the "heads" argument of unbundle."""
2256 2256 h = inpart.read(20)
2257 2257 heads = []
2258 2258 while len(h) == 20:
2259 2259 heads.append(h)
2260 2260 h = inpart.read(20)
2261 2261 assert not h
2262 2262 # Trigger a transaction so that we are guaranteed to have the lock now.
2263 2263 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2264 2264 op.gettransaction()
2265 2265 if sorted(heads) != sorted(op.repo.heads()):
2266 2266 raise error.PushRaced(
2267 2267 b'remote repository changed while pushing - please try again'
2268 2268 )
2269 2269
2270 2270
2271 2271 @parthandler(b'check:updated-heads')
2272 2272 def handlecheckupdatedheads(op, inpart):
2273 2273 """check for race on the heads touched by a push
2274 2274
2275 2275 This is similar to 'check:heads' but focus on the heads actually updated
2276 2276 during the push. If other activities happen on unrelated heads, it is
2277 2277 ignored.
2278 2278
2279 2279 This allow server with high traffic to avoid push contention as long as
2280 2280 unrelated parts of the graph are involved."""
2281 2281 h = inpart.read(20)
2282 2282 heads = []
2283 2283 while len(h) == 20:
2284 2284 heads.append(h)
2285 2285 h = inpart.read(20)
2286 2286 assert not h
2287 2287 # trigger a transaction so that we are guaranteed to have the lock now.
2288 2288 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2289 2289 op.gettransaction()
2290 2290
2291 2291 currentheads = set()
2292 2292 for ls in op.repo.branchmap().iterheads():
2293 2293 currentheads.update(ls)
2294 2294
2295 2295 for h in heads:
2296 2296 if h not in currentheads:
2297 2297 raise error.PushRaced(
2298 2298 b'remote repository changed while pushing - '
2299 2299 b'please try again'
2300 2300 )
2301 2301
2302 2302
2303 2303 @parthandler(b'check:phases')
2304 2304 def handlecheckphases(op, inpart):
2305 2305 """check that phase boundaries of the repository did not change
2306 2306
2307 2307 This is used to detect a push race.
2308 2308 """
2309 2309 phasetonodes = phases.binarydecode(inpart)
2310 2310 unfi = op.repo.unfiltered()
2311 2311 cl = unfi.changelog
2312 2312 phasecache = unfi._phasecache
2313 2313 msg = (
2314 2314 b'remote repository changed while pushing - please try again '
2315 2315 b'(%s is %s expected %s)'
2316 2316 )
2317 2317 for expectedphase, nodes in phasetonodes.items():
2318 2318 for n in nodes:
2319 2319 actualphase = phasecache.phase(unfi, cl.rev(n))
2320 2320 if actualphase != expectedphase:
2321 2321 finalmsg = msg % (
2322 2322 short(n),
2323 2323 phases.phasenames[actualphase],
2324 2324 phases.phasenames[expectedphase],
2325 2325 )
2326 2326 raise error.PushRaced(finalmsg)
2327 2327
2328 2328
2329 2329 @parthandler(b'output')
2330 2330 def handleoutput(op, inpart):
2331 2331 """forward output captured on the server to the client"""
2332 2332 for line in inpart.read().splitlines():
2333 2333 op.ui.status(_(b'remote: %s\n') % line)
2334 2334
2335 2335
2336 2336 @parthandler(b'replycaps')
2337 2337 def handlereplycaps(op, inpart):
2338 2338 """Notify that a reply bundle should be created
2339 2339
2340 2340 The payload contains the capabilities information for the reply"""
2341 2341 caps = decodecaps(inpart.read())
2342 2342 if op.reply is None:
2343 2343 op.reply = bundle20(op.ui, caps)
2344 2344
2345 2345
2346 2346 class AbortFromPart(error.Abort):
2347 2347 """Sub-class of Abort that denotes an error from a bundle2 part."""
2348 2348
2349 2349
2350 2350 @parthandler(b'error:abort', (b'message', b'hint'))
2351 2351 def handleerrorabort(op, inpart):
2352 2352 """Used to transmit abort error over the wire"""
2353 2353 raise AbortFromPart(
2354 2354 inpart.params[b'message'], hint=inpart.params.get(b'hint')
2355 2355 )
2356 2356
2357 2357
2358 2358 @parthandler(
2359 2359 b'error:pushkey',
2360 2360 (b'namespace', b'key', b'new', b'old', b'ret', b'in-reply-to'),
2361 2361 )
2362 2362 def handleerrorpushkey(op, inpart):
2363 2363 """Used to transmit failure of a mandatory pushkey over the wire"""
2364 2364 kwargs = {}
2365 2365 for name in (b'namespace', b'key', b'new', b'old', b'ret'):
2366 2366 value = inpart.params.get(name)
2367 2367 if value is not None:
2368 2368 kwargs[name] = value
2369 2369 raise error.PushkeyFailed(
2370 2370 inpart.params[b'in-reply-to'], **pycompat.strkwargs(kwargs)
2371 2371 )
2372 2372
2373 2373
2374 2374 @parthandler(b'error:unsupportedcontent', (b'parttype', b'params'))
2375 2375 def handleerrorunsupportedcontent(op, inpart):
2376 2376 """Used to transmit unknown content error over the wire"""
2377 2377 kwargs = {}
2378 2378 parttype = inpart.params.get(b'parttype')
2379 2379 if parttype is not None:
2380 2380 kwargs[b'parttype'] = parttype
2381 2381 params = inpart.params.get(b'params')
2382 2382 if params is not None:
2383 2383 kwargs[b'params'] = params.split(b'\0')
2384 2384
2385 2385 raise error.BundleUnknownFeatureError(**pycompat.strkwargs(kwargs))
2386 2386
2387 2387
2388 2388 @parthandler(b'error:pushraced', (b'message',))
2389 2389 def handleerrorpushraced(op, inpart):
2390 2390 """Used to transmit push race error over the wire"""
2391 2391 raise error.ResponseError(_(b'push failed:'), inpart.params[b'message'])
2392 2392
2393 2393
2394 2394 @parthandler(b'listkeys', (b'namespace',))
2395 2395 def handlelistkeys(op, inpart):
2396 2396 """retrieve pushkey namespace content stored in a bundle2"""
2397 2397 namespace = inpart.params[b'namespace']
2398 2398 r = pushkey.decodekeys(inpart.read())
2399 2399 op.records.add(b'listkeys', (namespace, r))
2400 2400
2401 2401
2402 2402 @parthandler(b'pushkey', (b'namespace', b'key', b'old', b'new'))
2403 2403 def handlepushkey(op, inpart):
2404 2404 """process a pushkey request"""
2405 2405 dec = pushkey.decode
2406 2406 namespace = dec(inpart.params[b'namespace'])
2407 2407 key = dec(inpart.params[b'key'])
2408 2408 old = dec(inpart.params[b'old'])
2409 2409 new = dec(inpart.params[b'new'])
2410 2410 # Grab the transaction to ensure that we have the lock before performing the
2411 2411 # pushkey.
2412 2412 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2413 2413 op.gettransaction()
2414 2414 ret = op.repo.pushkey(namespace, key, old, new)
2415 2415 record = {b'namespace': namespace, b'key': key, b'old': old, b'new': new}
2416 2416 op.records.add(b'pushkey', record)
2417 2417 if op.reply is not None:
2418 2418 rpart = op.reply.newpart(b'reply:pushkey')
2419 2419 rpart.addparam(
2420 2420 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2421 2421 )
2422 2422 rpart.addparam(b'return', b'%i' % ret, mandatory=False)
2423 2423 if inpart.mandatory and not ret:
2424 2424 kwargs = {}
2425 2425 for key in (b'namespace', b'key', b'new', b'old', b'ret'):
2426 2426 if key in inpart.params:
2427 2427 kwargs[key] = inpart.params[key]
2428 2428 raise error.PushkeyFailed(
2429 2429 partid=b'%d' % inpart.id, **pycompat.strkwargs(kwargs)
2430 2430 )
2431 2431
2432 2432
2433 2433 @parthandler(b'bookmarks')
2434 2434 def handlebookmark(op, inpart):
2435 2435 """transmit bookmark information
2436 2436
2437 2437 The part contains binary encoded bookmark information.
2438 2438
2439 2439 The exact behavior of this part can be controlled by the 'bookmarks' mode
2440 2440 on the bundle operation.
2441 2441
2442 2442 When mode is 'apply' (the default) the bookmark information is applied as
2443 2443 is to the unbundling repository. Make sure a 'check:bookmarks' part is
2444 2444 issued earlier to check for push races in such update. This behavior is
2445 2445 suitable for pushing.
2446 2446
2447 2447 When mode is 'records', the information is recorded into the 'bookmarks'
2448 2448 records of the bundle operation. This behavior is suitable for pulling.
2449 2449 """
2450 2450 changes = bookmarks.binarydecode(op.repo, inpart)
2451 2451
2452 2452 pushkeycompat = op.repo.ui.configbool(
2453 2453 b'server', b'bookmarks-pushkey-compat'
2454 2454 )
2455 2455 bookmarksmode = op.modes.get(b'bookmarks', b'apply')
2456 2456
2457 2457 if bookmarksmode == b'apply':
2458 2458 tr = op.gettransaction()
2459 2459 bookstore = op.repo._bookmarks
2460 2460 if pushkeycompat:
2461 2461 allhooks = []
2462 2462 for book, node in changes:
2463 2463 hookargs = tr.hookargs.copy()
2464 2464 hookargs[b'pushkeycompat'] = b'1'
2465 2465 hookargs[b'namespace'] = b'bookmarks'
2466 2466 hookargs[b'key'] = book
2467 2467 hookargs[b'old'] = hex(bookstore.get(book, b''))
2468 2468 hookargs[b'new'] = hex(node if node is not None else b'')
2469 2469 allhooks.append(hookargs)
2470 2470
2471 2471 for hookargs in allhooks:
2472 2472 op.repo.hook(
2473 2473 b'prepushkey', throw=True, **pycompat.strkwargs(hookargs)
2474 2474 )
2475 2475
2476 2476 for book, node in changes:
2477 2477 if bookmarks.isdivergent(book):
2478 2478 msg = _(b'cannot accept divergent bookmark %s!') % book
2479 2479 raise error.Abort(msg)
2480 2480
2481 2481 bookstore.applychanges(op.repo, op.gettransaction(), changes)
2482 2482
2483 2483 if pushkeycompat:
2484 2484
2485 2485 def runhook(unused_success):
2486 2486 for hookargs in allhooks:
2487 2487 op.repo.hook(b'pushkey', **pycompat.strkwargs(hookargs))
2488 2488
2489 2489 op.repo._afterlock(runhook)
2490 2490
2491 2491 elif bookmarksmode == b'records':
2492 2492 for book, node in changes:
2493 2493 record = {b'bookmark': book, b'node': node}
2494 2494 op.records.add(b'bookmarks', record)
2495 2495 else:
2496 2496 raise error.ProgrammingError(
2497 2497 b'unknown bookmark mode: %s' % bookmarksmode
2498 2498 )
2499 2499
2500 2500
2501 2501 @parthandler(b'phase-heads')
2502 2502 def handlephases(op, inpart):
2503 2503 """apply phases from bundle part to repo"""
2504 2504 headsbyphase = phases.binarydecode(inpart)
2505 2505 phases.updatephases(op.repo.unfiltered(), op.gettransaction, headsbyphase)
2506 2506
2507 2507
2508 2508 @parthandler(b'reply:pushkey', (b'return', b'in-reply-to'))
2509 2509 def handlepushkeyreply(op, inpart):
2510 2510 """retrieve the result of a pushkey request"""
2511 2511 ret = int(inpart.params[b'return'])
2512 2512 partid = int(inpart.params[b'in-reply-to'])
2513 2513 op.records.add(b'pushkey', {b'return': ret}, partid)
2514 2514
2515 2515
2516 2516 @parthandler(b'obsmarkers')
2517 2517 def handleobsmarker(op, inpart):
2518 2518 """add a stream of obsmarkers to the repo"""
2519 2519 tr = op.gettransaction()
2520 2520 markerdata = inpart.read()
2521 2521 if op.ui.config(b'experimental', b'obsmarkers-exchange-debug'):
2522 2522 op.ui.writenoi18n(
2523 2523 b'obsmarker-exchange: %i bytes received\n' % len(markerdata)
2524 2524 )
2525 2525 # The mergemarkers call will crash if marker creation is not enabled.
2526 2526 # we want to avoid this if the part is advisory.
2527 2527 if not inpart.mandatory and op.repo.obsstore.readonly:
2528 2528 op.repo.ui.debug(
2529 2529 b'ignoring obsolescence markers, feature not enabled\n'
2530 2530 )
2531 2531 return
2532 2532 new = op.repo.obsstore.mergemarkers(tr, markerdata)
2533 2533 op.repo.invalidatevolatilesets()
2534 2534 op.records.add(b'obsmarkers', {b'new': new})
2535 2535 if op.reply is not None:
2536 2536 rpart = op.reply.newpart(b'reply:obsmarkers')
2537 2537 rpart.addparam(
2538 2538 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2539 2539 )
2540 2540 rpart.addparam(b'new', b'%i' % new, mandatory=False)
2541 2541
2542 2542
2543 2543 @parthandler(b'reply:obsmarkers', (b'new', b'in-reply-to'))
2544 2544 def handleobsmarkerreply(op, inpart):
2545 2545 """retrieve the result of a pushkey request"""
2546 2546 ret = int(inpart.params[b'new'])
2547 2547 partid = int(inpart.params[b'in-reply-to'])
2548 2548 op.records.add(b'obsmarkers', {b'new': ret}, partid)
2549 2549
2550 2550
2551 2551 @parthandler(b'hgtagsfnodes')
2552 2552 def handlehgtagsfnodes(op, inpart):
2553 2553 """Applies .hgtags fnodes cache entries to the local repo.
2554 2554
2555 2555 Payload is pairs of 20 byte changeset nodes and filenodes.
2556 2556 """
2557 2557 # Grab the transaction so we ensure that we have the lock at this point.
2558 2558 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2559 2559 op.gettransaction()
2560 2560 cache = tags.hgtagsfnodescache(op.repo.unfiltered())
2561 2561
2562 2562 count = 0
2563 2563 while True:
2564 2564 node = inpart.read(20)
2565 2565 fnode = inpart.read(20)
2566 2566 if len(node) < 20 or len(fnode) < 20:
2567 2567 op.ui.debug(b'ignoring incomplete received .hgtags fnodes data\n')
2568 2568 break
2569 2569 cache.setfnode(node, fnode)
2570 2570 count += 1
2571 2571
2572 2572 cache.write()
2573 2573 op.ui.debug(b'applied %i hgtags fnodes cache entries\n' % count)
2574 2574
2575 2575
2576 2576 rbcstruct = struct.Struct(b'>III')
2577 2577
2578 2578
2579 2579 @parthandler(b'cache:rev-branch-cache')
2580 2580 def handlerbc(op, inpart):
2581 2581 """Legacy part, ignored for compatibility with bundles from or
2582 2582 for Mercurial before 5.7. Newer Mercurial computes the cache
2583 2583 efficiently enough during unbundling that the additional transfer
2584 2584 is unnecessary."""
2585 2585
2586 2586
2587 2587 @parthandler(b'pushvars')
2588 2588 def bundle2getvars(op, part):
2589 2589 '''unbundle a bundle2 containing shellvars on the server'''
2590 2590 # An option to disable unbundling on server-side for security reasons
2591 2591 if op.ui.configbool(b'push', b'pushvars.server'):
2592 2592 hookargs = {}
2593 2593 for key, value in part.advisoryparams:
2594 2594 key = key.upper()
2595 2595 # We want pushed variables to have USERVAR_ prepended so we know
2596 2596 # they came from the --pushvar flag.
2597 2597 key = b"USERVAR_" + key
2598 2598 hookargs[key] = value
2599 2599 op.addhookargs(hookargs)
2600 2600
2601 2601
2602 2602 @parthandler(b'stream2', (b'requirements', b'filecount', b'bytecount'))
2603 2603 def handlestreamv2bundle(op, part):
2604 2604
2605 2605 requirements = urlreq.unquote(part.params[b'requirements'])
2606 2606 requirements = requirements.split(b',') if requirements else []
2607 2607 filecount = int(part.params[b'filecount'])
2608 2608 bytecount = int(part.params[b'bytecount'])
2609 2609
2610 2610 repo = op.repo
2611 2611 if len(repo):
2612 2612 msg = _(b'cannot apply stream clone to non empty repository')
2613 2613 raise error.Abort(msg)
2614 2614
2615 2615 repo.ui.debug(b'applying stream bundle\n')
2616 2616 streamclone.applybundlev2(repo, part, filecount, bytecount, requirements)
2617 2617
2618 2618
2619 2619 @parthandler(b'stream3-exp', (b'requirements', b'filecount', b'bytecount'))
2620 2620 def handlestreamv3bundle(op, part):
2621 2621 return handlestreamv2bundle(op, part)
2622 2622
2623 2623
2624 2624 def widen_bundle(
2625 2625 bundler, repo, oldmatcher, newmatcher, common, known, cgversion, ellipses
2626 2626 ):
2627 2627 """generates bundle2 for widening a narrow clone
2628 2628
2629 2629 bundler is the bundle to which data should be added
2630 2630 repo is the localrepository instance
2631 2631 oldmatcher matches what the client already has
2632 2632 newmatcher matches what the client needs (including what it already has)
2633 2633 common is set of common heads between server and client
2634 2634 known is a set of revs known on the client side (used in ellipses)
2635 2635 cgversion is the changegroup version to send
2636 2636 ellipses is boolean value telling whether to send ellipses data or not
2637 2637
2638 2638 returns bundle2 of the data required for extending
2639 2639 """
2640 2640 commonnodes = set()
2641 2641 cl = repo.changelog
2642 2642 for r in repo.revs(b"::%ln", common):
2643 2643 commonnodes.add(cl.node(r))
2644 2644 if commonnodes:
2645 2645 packer = changegroup.getbundler(
2646 2646 cgversion,
2647 2647 repo,
2648 2648 oldmatcher=oldmatcher,
2649 2649 matcher=newmatcher,
2650 2650 fullnodes=commonnodes,
2651 2651 )
2652 2652 cgdata = packer.generate(
2653 2653 {repo.nullid},
2654 2654 list(commonnodes),
2655 2655 False,
2656 2656 b'narrow_widen',
2657 2657 changelog=False,
2658 2658 )
2659 2659
2660 2660 part = bundler.newpart(b'changegroup', data=cgdata)
2661 2661 part.addparam(b'version', cgversion)
2662 2662 if scmutil.istreemanifest(repo):
2663 2663 part.addparam(b'treemanifest', b'1')
2664 2664 if repository.REPO_FEATURE_SIDE_DATA in repo.features:
2665 2665 part.addparam(b'exp-sidedata', b'1')
2666 2666 wanted = format_remote_wanted_sidedata(repo)
2667 2667 part.addparam(b'exp-wanted-sidedata', wanted)
2668 2668
2669 2669 return bundler
General Comments 0
You need to be logged in to leave comments. Login now