##// END OF EJS Templates
bundle2: use sysstr to check for attribute presence...
marmoute -
r51786:0e936b95 default
parent child Browse files
Show More
@@ -1,2676 +1,2676 b''
1 1 # bundle2.py - generic container format to transmit arbitrary data.
2 2 #
3 3 # Copyright 2013 Facebook, Inc.
4 4 #
5 5 # This software may be used and distributed according to the terms of the
6 6 # GNU General Public License version 2 or any later version.
7 7 """Handling of the new bundle2 format
8 8
9 9 The goal of bundle2 is to act as an atomically packet to transmit a set of
10 10 payloads in an application agnostic way. It consist in a sequence of "parts"
11 11 that will be handed to and processed by the application layer.
12 12
13 13
14 14 General format architecture
15 15 ===========================
16 16
17 17 The format is architectured as follow
18 18
19 19 - magic string
20 20 - stream level parameters
21 21 - payload parts (any number)
22 22 - end of stream marker.
23 23
24 24 the Binary format
25 25 ============================
26 26
27 27 All numbers are unsigned and big-endian.
28 28
29 29 stream level parameters
30 30 ------------------------
31 31
32 32 Binary format is as follow
33 33
34 34 :params size: int32
35 35
36 36 The total number of Bytes used by the parameters
37 37
38 38 :params value: arbitrary number of Bytes
39 39
40 40 A blob of `params size` containing the serialized version of all stream level
41 41 parameters.
42 42
43 43 The blob contains a space separated list of parameters. Parameters with value
44 44 are stored in the form `<name>=<value>`. Both name and value are urlquoted.
45 45
46 46 Empty name are obviously forbidden.
47 47
48 48 Name MUST start with a letter. If this first letter is lower case, the
49 49 parameter is advisory and can be safely ignored. However when the first
50 50 letter is capital, the parameter is mandatory and the bundling process MUST
51 51 stop if he is not able to proceed it.
52 52
53 53 Stream parameters use a simple textual format for two main reasons:
54 54
55 55 - Stream level parameters should remain simple and we want to discourage any
56 56 crazy usage.
57 57 - Textual data allow easy human inspection of a bundle2 header in case of
58 58 troubles.
59 59
60 60 Any Applicative level options MUST go into a bundle2 part instead.
61 61
62 62 Payload part
63 63 ------------------------
64 64
65 65 Binary format is as follow
66 66
67 67 :header size: int32
68 68
69 69 The total number of Bytes used by the part header. When the header is empty
70 70 (size = 0) this is interpreted as the end of stream marker.
71 71
72 72 :header:
73 73
74 74 The header defines how to interpret the part. It contains two piece of
75 75 data: the part type, and the part parameters.
76 76
77 77 The part type is used to route an application level handler, that can
78 78 interpret payload.
79 79
80 80 Part parameters are passed to the application level handler. They are
81 81 meant to convey information that will help the application level object to
82 82 interpret the part payload.
83 83
84 84 The binary format of the header is has follow
85 85
86 86 :typesize: (one byte)
87 87
88 88 :parttype: alphanumerical part name (restricted to [a-zA-Z0-9_:-]*)
89 89
90 90 :partid: A 32bits integer (unique in the bundle) that can be used to refer
91 91 to this part.
92 92
93 93 :parameters:
94 94
95 95 Part's parameter may have arbitrary content, the binary structure is::
96 96
97 97 <mandatory-count><advisory-count><param-sizes><param-data>
98 98
99 99 :mandatory-count: 1 byte, number of mandatory parameters
100 100
101 101 :advisory-count: 1 byte, number of advisory parameters
102 102
103 103 :param-sizes:
104 104
105 105 N couple of bytes, where N is the total number of parameters. Each
106 106 couple contains (<size-of-key>, <size-of-value) for one parameter.
107 107
108 108 :param-data:
109 109
110 110 A blob of bytes from which each parameter key and value can be
111 111 retrieved using the list of size couples stored in the previous
112 112 field.
113 113
114 114 Mandatory parameters comes first, then the advisory ones.
115 115
116 116 Each parameter's key MUST be unique within the part.
117 117
118 118 :payload:
119 119
120 120 payload is a series of `<chunksize><chunkdata>`.
121 121
122 122 `chunksize` is an int32, `chunkdata` are plain bytes (as much as
123 123 `chunksize` says)` The payload part is concluded by a zero size chunk.
124 124
125 125 The current implementation always produces either zero or one chunk.
126 126 This is an implementation limitation that will ultimately be lifted.
127 127
128 128 `chunksize` can be negative to trigger special case processing. No such
129 129 processing is in place yet.
130 130
131 131 Bundle processing
132 132 ============================
133 133
134 134 Each part is processed in order using a "part handler". Handler are registered
135 135 for a certain part type.
136 136
137 137 The matching of a part to its handler is case insensitive. The case of the
138 138 part type is used to know if a part is mandatory or advisory. If the Part type
139 139 contains any uppercase char it is considered mandatory. When no handler is
140 140 known for a Mandatory part, the process is aborted and an exception is raised.
141 141 If the part is advisory and no handler is known, the part is ignored. When the
142 142 process is aborted, the full bundle is still read from the stream to keep the
143 143 channel usable. But none of the part read from an abort are processed. In the
144 144 future, dropping the stream may become an option for channel we do not care to
145 145 preserve.
146 146 """
147 147
148 148
149 149 import collections
150 150 import errno
151 151 import os
152 152 import re
153 153 import string
154 154 import struct
155 155 import sys
156 156
157 157 from .i18n import _
158 158 from .node import (
159 159 hex,
160 160 short,
161 161 )
162 162 from . import (
163 163 bookmarks,
164 164 changegroup,
165 165 encoding,
166 166 error,
167 167 obsolete,
168 168 phases,
169 169 pushkey,
170 170 pycompat,
171 171 requirements,
172 172 scmutil,
173 173 streamclone,
174 174 tags,
175 175 url,
176 176 util,
177 177 )
178 178 from .utils import (
179 179 stringutil,
180 180 urlutil,
181 181 )
182 182 from .interfaces import repository
183 183
184 184 urlerr = util.urlerr
185 185 urlreq = util.urlreq
186 186
187 187 _pack = struct.pack
188 188 _unpack = struct.unpack
189 189
190 190 _fstreamparamsize = b'>i'
191 191 _fpartheadersize = b'>i'
192 192 _fparttypesize = b'>B'
193 193 _fpartid = b'>I'
194 194 _fpayloadsize = b'>i'
195 195 _fpartparamcount = b'>BB'
196 196
197 197 preferedchunksize = 32768
198 198
199 199 _parttypeforbidden = re.compile(b'[^a-zA-Z0-9_:-]')
200 200
201 201
202 202 def outdebug(ui, message):
203 203 """debug regarding output stream (bundling)"""
204 204 if ui.configbool(b'devel', b'bundle2.debug'):
205 205 ui.debug(b'bundle2-output: %s\n' % message)
206 206
207 207
208 208 def indebug(ui, message):
209 209 """debug on input stream (unbundling)"""
210 210 if ui.configbool(b'devel', b'bundle2.debug'):
211 211 ui.debug(b'bundle2-input: %s\n' % message)
212 212
213 213
214 214 def validateparttype(parttype):
215 215 """raise ValueError if a parttype contains invalid character"""
216 216 if _parttypeforbidden.search(parttype):
217 217 raise ValueError(parttype)
218 218
219 219
220 220 def _makefpartparamsizes(nbparams):
221 221 """return a struct format to read part parameter sizes
222 222
223 223 The number parameters is variable so we need to build that format
224 224 dynamically.
225 225 """
226 226 return b'>' + (b'BB' * nbparams)
227 227
228 228
229 229 parthandlermapping = {}
230 230
231 231
232 232 def parthandler(parttype, params=()):
233 233 """decorator that register a function as a bundle2 part handler
234 234
235 235 eg::
236 236
237 237 @parthandler('myparttype', ('mandatory', 'param', 'handled'))
238 238 def myparttypehandler(...):
239 239 '''process a part of type "my part".'''
240 240 ...
241 241 """
242 242 validateparttype(parttype)
243 243
244 244 def _decorator(func):
245 245 lparttype = parttype.lower() # enforce lower case matching.
246 246 assert lparttype not in parthandlermapping
247 247 parthandlermapping[lparttype] = func
248 248 func.params = frozenset(params)
249 249 return func
250 250
251 251 return _decorator
252 252
253 253
254 254 class unbundlerecords:
255 255 """keep record of what happens during and unbundle
256 256
257 257 New records are added using `records.add('cat', obj)`. Where 'cat' is a
258 258 category of record and obj is an arbitrary object.
259 259
260 260 `records['cat']` will return all entries of this category 'cat'.
261 261
262 262 Iterating on the object itself will yield `('category', obj)` tuples
263 263 for all entries.
264 264
265 265 All iterations happens in chronological order.
266 266 """
267 267
268 268 def __init__(self):
269 269 self._categories = {}
270 270 self._sequences = []
271 271 self._replies = {}
272 272
273 273 def add(self, category, entry, inreplyto=None):
274 274 """add a new record of a given category.
275 275
276 276 The entry can then be retrieved in the list returned by
277 277 self['category']."""
278 278 self._categories.setdefault(category, []).append(entry)
279 279 self._sequences.append((category, entry))
280 280 if inreplyto is not None:
281 281 self.getreplies(inreplyto).add(category, entry)
282 282
283 283 def getreplies(self, partid):
284 284 """get the records that are replies to a specific part"""
285 285 return self._replies.setdefault(partid, unbundlerecords())
286 286
287 287 def __getitem__(self, cat):
288 288 return tuple(self._categories.get(cat, ()))
289 289
290 290 def __iter__(self):
291 291 return iter(self._sequences)
292 292
293 293 def __len__(self):
294 294 return len(self._sequences)
295 295
296 296 def __nonzero__(self):
297 297 return bool(self._sequences)
298 298
299 299 __bool__ = __nonzero__
300 300
301 301
302 302 class bundleoperation:
303 303 """an object that represents a single bundling process
304 304
305 305 Its purpose is to carry unbundle-related objects and states.
306 306
307 307 A new object should be created at the beginning of each bundle processing.
308 308 The object is to be returned by the processing function.
309 309
310 310 The object has very little content now it will ultimately contain:
311 311 * an access to the repo the bundle is applied to,
312 312 * a ui object,
313 313 * a way to retrieve a transaction to add changes to the repo,
314 314 * a way to record the result of processing each part,
315 315 * a way to construct a bundle response when applicable.
316 316 """
317 317
318 318 def __init__(
319 319 self,
320 320 repo,
321 321 transactiongetter,
322 322 captureoutput=True,
323 323 source=b'',
324 324 remote=None,
325 325 ):
326 326 self.repo = repo
327 327 # the peer object who produced this bundle if available
328 328 self.remote = remote
329 329 self.ui = repo.ui
330 330 self.records = unbundlerecords()
331 331 self.reply = None
332 332 self.captureoutput = captureoutput
333 333 self.hookargs = {}
334 334 self._gettransaction = transactiongetter
335 335 # carries value that can modify part behavior
336 336 self.modes = {}
337 337 self.source = source
338 338
339 339 def gettransaction(self):
340 340 transaction = self._gettransaction()
341 341
342 342 if self.hookargs:
343 343 # the ones added to the transaction supercede those added
344 344 # to the operation.
345 345 self.hookargs.update(transaction.hookargs)
346 346 transaction.hookargs = self.hookargs
347 347
348 348 # mark the hookargs as flushed. further attempts to add to
349 349 # hookargs will result in an abort.
350 350 self.hookargs = None
351 351
352 352 return transaction
353 353
354 354 def addhookargs(self, hookargs):
355 355 if self.hookargs is None:
356 356 raise error.ProgrammingError(
357 357 b'attempted to add hookargs to '
358 358 b'operation after transaction started'
359 359 )
360 360 self.hookargs.update(hookargs)
361 361
362 362
363 363 class TransactionUnavailable(RuntimeError):
364 364 pass
365 365
366 366
367 367 def _notransaction():
368 368 """default method to get a transaction while processing a bundle
369 369
370 370 Raise an exception to highlight the fact that no transaction was expected
371 371 to be created"""
372 372 raise TransactionUnavailable()
373 373
374 374
375 375 def applybundle(repo, unbundler, tr, source, url=None, remote=None, **kwargs):
376 376 # transform me into unbundler.apply() as soon as the freeze is lifted
377 377 if isinstance(unbundler, unbundle20):
378 378 tr.hookargs[b'bundle2'] = b'1'
379 379 if source is not None and b'source' not in tr.hookargs:
380 380 tr.hookargs[b'source'] = source
381 381 if url is not None and b'url' not in tr.hookargs:
382 382 tr.hookargs[b'url'] = url
383 383 return processbundle(
384 384 repo, unbundler, lambda: tr, source=source, remote=remote
385 385 )
386 386 else:
387 387 # the transactiongetter won't be used, but we might as well set it
388 388 op = bundleoperation(repo, lambda: tr, source=source, remote=remote)
389 389 _processchangegroup(op, unbundler, tr, source, url, **kwargs)
390 390 return op
391 391
392 392
393 393 class partiterator:
394 394 def __init__(self, repo, op, unbundler):
395 395 self.repo = repo
396 396 self.op = op
397 397 self.unbundler = unbundler
398 398 self.iterator = None
399 399 self.count = 0
400 400 self.current = None
401 401
402 402 def __enter__(self):
403 403 def func():
404 404 itr = enumerate(self.unbundler.iterparts(), 1)
405 405 for count, p in itr:
406 406 self.count = count
407 407 self.current = p
408 408 yield p
409 409 p.consume()
410 410 self.current = None
411 411
412 412 self.iterator = func()
413 413 return self.iterator
414 414
415 415 def __exit__(self, type, exc, tb):
416 416 if not self.iterator:
417 417 return
418 418
419 419 # Only gracefully abort in a normal exception situation. User aborts
420 420 # like Ctrl+C throw a KeyboardInterrupt which is not a base Exception,
421 421 # and should not gracefully cleanup.
422 422 if isinstance(exc, Exception):
423 423 # Any exceptions seeking to the end of the bundle at this point are
424 424 # almost certainly related to the underlying stream being bad.
425 425 # And, chances are that the exception we're handling is related to
426 426 # getting in that bad state. So, we swallow the seeking error and
427 427 # re-raise the original error.
428 428 seekerror = False
429 429 try:
430 430 if self.current:
431 431 # consume the part content to not corrupt the stream.
432 432 self.current.consume()
433 433
434 434 for part in self.iterator:
435 435 # consume the bundle content
436 436 part.consume()
437 437 except Exception:
438 438 seekerror = True
439 439
440 440 # Small hack to let caller code distinguish exceptions from bundle2
441 441 # processing from processing the old format. This is mostly needed
442 442 # to handle different return codes to unbundle according to the type
443 443 # of bundle. We should probably clean up or drop this return code
444 444 # craziness in a future version.
445 445 exc.duringunbundle2 = True
446 446 salvaged = []
447 447 replycaps = None
448 448 if self.op.reply is not None:
449 449 salvaged = self.op.reply.salvageoutput()
450 450 replycaps = self.op.reply.capabilities
451 451 exc._replycaps = replycaps
452 452 exc._bundle2salvagedoutput = salvaged
453 453
454 454 # Re-raising from a variable loses the original stack. So only use
455 455 # that form if we need to.
456 456 if seekerror:
457 457 raise exc
458 458
459 459 self.repo.ui.debug(
460 460 b'bundle2-input-bundle: %i parts total\n' % self.count
461 461 )
462 462
463 463
464 464 def processbundle(
465 465 repo,
466 466 unbundler,
467 467 transactiongetter=None,
468 468 op=None,
469 469 source=b'',
470 470 remote=None,
471 471 ):
472 472 """This function process a bundle, apply effect to/from a repo
473 473
474 474 It iterates over each part then searches for and uses the proper handling
475 475 code to process the part. Parts are processed in order.
476 476
477 477 Unknown Mandatory part will abort the process.
478 478
479 479 It is temporarily possible to provide a prebuilt bundleoperation to the
480 480 function. This is used to ensure output is properly propagated in case of
481 481 an error during the unbundling. This output capturing part will likely be
482 482 reworked and this ability will probably go away in the process.
483 483 """
484 484 if op is None:
485 485 if transactiongetter is None:
486 486 transactiongetter = _notransaction
487 487 op = bundleoperation(
488 488 repo,
489 489 transactiongetter,
490 490 source=source,
491 491 remote=remote,
492 492 )
493 493 # todo:
494 494 # - replace this is a init function soon.
495 495 # - exception catching
496 496 unbundler.params
497 497 if repo.ui.debugflag:
498 498 msg = [b'bundle2-input-bundle:']
499 499 if unbundler.params:
500 500 msg.append(b' %i params' % len(unbundler.params))
501 501 if op._gettransaction is None or op._gettransaction is _notransaction:
502 502 msg.append(b' no-transaction')
503 503 else:
504 504 msg.append(b' with-transaction')
505 505 msg.append(b'\n')
506 506 repo.ui.debug(b''.join(msg))
507 507
508 508 processparts(repo, op, unbundler)
509 509
510 510 return op
511 511
512 512
513 513 def processparts(repo, op, unbundler):
514 514 with partiterator(repo, op, unbundler) as parts:
515 515 for part in parts:
516 516 _processpart(op, part)
517 517
518 518
519 519 def _processchangegroup(op, cg, tr, source, url, **kwargs):
520 520 if op.remote is not None and op.remote.path is not None:
521 521 remote_path = op.remote.path
522 522 kwargs = kwargs.copy()
523 523 kwargs['delta_base_reuse_policy'] = remote_path.delta_reuse_policy
524 524 ret = cg.apply(op.repo, tr, source, url, **kwargs)
525 525 op.records.add(
526 526 b'changegroup',
527 527 {
528 528 b'return': ret,
529 529 },
530 530 )
531 531 return ret
532 532
533 533
534 534 def _gethandler(op, part):
535 535 status = b'unknown' # used by debug output
536 536 try:
537 537 handler = parthandlermapping.get(part.type)
538 538 if handler is None:
539 539 status = b'unsupported-type'
540 540 raise error.BundleUnknownFeatureError(parttype=part.type)
541 541 indebug(op.ui, b'found a handler for part %s' % part.type)
542 542 unknownparams = part.mandatorykeys - handler.params
543 543 if unknownparams:
544 544 unknownparams = list(unknownparams)
545 545 unknownparams.sort()
546 546 status = b'unsupported-params (%s)' % b', '.join(unknownparams)
547 547 raise error.BundleUnknownFeatureError(
548 548 parttype=part.type, params=unknownparams
549 549 )
550 550 status = b'supported'
551 551 except error.BundleUnknownFeatureError as exc:
552 552 if part.mandatory: # mandatory parts
553 553 raise
554 554 indebug(op.ui, b'ignoring unsupported advisory part %s' % exc)
555 555 return # skip to part processing
556 556 finally:
557 557 if op.ui.debugflag:
558 558 msg = [b'bundle2-input-part: "%s"' % part.type]
559 559 if not part.mandatory:
560 560 msg.append(b' (advisory)')
561 561 nbmp = len(part.mandatorykeys)
562 562 nbap = len(part.params) - nbmp
563 563 if nbmp or nbap:
564 564 msg.append(b' (params:')
565 565 if nbmp:
566 566 msg.append(b' %i mandatory' % nbmp)
567 567 if nbap:
568 568 msg.append(b' %i advisory' % nbmp)
569 569 msg.append(b')')
570 570 msg.append(b' %s\n' % status)
571 571 op.ui.debug(b''.join(msg))
572 572
573 573 return handler
574 574
575 575
576 576 def _processpart(op, part):
577 577 """process a single part from a bundle
578 578
579 579 The part is guaranteed to have been fully consumed when the function exits
580 580 (even if an exception is raised)."""
581 581 handler = _gethandler(op, part)
582 582 if handler is None:
583 583 return
584 584
585 585 # handler is called outside the above try block so that we don't
586 586 # risk catching KeyErrors from anything other than the
587 587 # parthandlermapping lookup (any KeyError raised by handler()
588 588 # itself represents a defect of a different variety).
589 589 output = None
590 590 if op.captureoutput and op.reply is not None:
591 591 op.ui.pushbuffer(error=True, subproc=True)
592 592 output = b''
593 593 try:
594 594 handler(op, part)
595 595 finally:
596 596 if output is not None:
597 597 output = op.ui.popbuffer()
598 598 if output:
599 599 outpart = op.reply.newpart(b'output', data=output, mandatory=False)
600 600 outpart.addparam(
601 601 b'in-reply-to', pycompat.bytestr(part.id), mandatory=False
602 602 )
603 603
604 604
605 605 def decodecaps(blob):
606 606 """decode a bundle2 caps bytes blob into a dictionary
607 607
608 608 The blob is a list of capabilities (one per line)
609 609 Capabilities may have values using a line of the form::
610 610
611 611 capability=value1,value2,value3
612 612
613 613 The values are always a list."""
614 614 caps = {}
615 615 for line in blob.splitlines():
616 616 if not line:
617 617 continue
618 618 if b'=' not in line:
619 619 key, vals = line, ()
620 620 else:
621 621 key, vals = line.split(b'=', 1)
622 622 vals = vals.split(b',')
623 623 key = urlreq.unquote(key)
624 624 vals = [urlreq.unquote(v) for v in vals]
625 625 caps[key] = vals
626 626 return caps
627 627
628 628
629 629 def encodecaps(caps):
630 630 """encode a bundle2 caps dictionary into a bytes blob"""
631 631 chunks = []
632 632 for ca in sorted(caps):
633 633 vals = caps[ca]
634 634 ca = urlreq.quote(ca)
635 635 vals = [urlreq.quote(v) for v in vals]
636 636 if vals:
637 637 ca = b"%s=%s" % (ca, b','.join(vals))
638 638 chunks.append(ca)
639 639 return b'\n'.join(chunks)
640 640
641 641
642 642 bundletypes = {
643 643 b"": (b"", b'UN'), # only when using unbundle on ssh and old http servers
644 644 # since the unification ssh accepts a header but there
645 645 # is no capability signaling it.
646 646 b"HG20": (), # special-cased below
647 647 b"HG10UN": (b"HG10UN", b'UN'),
648 648 b"HG10BZ": (b"HG10", b'BZ'),
649 649 b"HG10GZ": (b"HG10GZ", b'GZ'),
650 650 }
651 651
652 652 # hgweb uses this list to communicate its preferred type
653 653 bundlepriority = [b'HG10GZ', b'HG10BZ', b'HG10UN']
654 654
655 655
656 656 class bundle20:
657 657 """represent an outgoing bundle2 container
658 658
659 659 Use the `addparam` method to add stream level parameter. and `newpart` to
660 660 populate it. Then call `getchunks` to retrieve all the binary chunks of
661 661 data that compose the bundle2 container."""
662 662
663 663 _magicstring = b'HG20'
664 664
665 665 def __init__(self, ui, capabilities=()):
666 666 self.ui = ui
667 667 self._params = []
668 668 self._parts = []
669 669 self.capabilities = dict(capabilities)
670 670 self._compengine = util.compengines.forbundletype(b'UN')
671 671 self._compopts = None
672 672 # If compression is being handled by a consumer of the raw
673 673 # data (e.g. the wire protocol), unsetting this flag tells
674 674 # consumers that the bundle is best left uncompressed.
675 675 self.prefercompressed = True
676 676
677 677 def setcompression(self, alg, compopts=None):
678 678 """setup core part compression to <alg>"""
679 679 if alg in (None, b'UN'):
680 680 return
681 681 assert not any(n.lower() == b'compression' for n, v in self._params)
682 682 self.addparam(b'Compression', alg)
683 683 self._compengine = util.compengines.forbundletype(alg)
684 684 self._compopts = compopts
685 685
686 686 @property
687 687 def nbparts(self):
688 688 """total number of parts added to the bundler"""
689 689 return len(self._parts)
690 690
691 691 # methods used to defines the bundle2 content
692 692 def addparam(self, name, value=None):
693 693 """add a stream level parameter"""
694 694 if not name:
695 695 raise error.ProgrammingError(b'empty parameter name')
696 696 if name[0:1] not in pycompat.bytestr(
697 697 string.ascii_letters # pytype: disable=wrong-arg-types
698 698 ):
699 699 raise error.ProgrammingError(
700 700 b'non letter first character: %s' % name
701 701 )
702 702 self._params.append((name, value))
703 703
704 704 def addpart(self, part):
705 705 """add a new part to the bundle2 container
706 706
707 707 Parts contains the actual applicative payload."""
708 708 assert part.id is None
709 709 part.id = len(self._parts) # very cheap counter
710 710 self._parts.append(part)
711 711
712 712 def newpart(self, typeid, *args, **kwargs):
713 713 """create a new part and add it to the containers
714 714
715 715 As the part is directly added to the containers. For now, this means
716 716 that any failure to properly initialize the part after calling
717 717 ``newpart`` should result in a failure of the whole bundling process.
718 718
719 719 You can still fall back to manually create and add if you need better
720 720 control."""
721 721 part = bundlepart(typeid, *args, **kwargs)
722 722 self.addpart(part)
723 723 return part
724 724
725 725 # methods used to generate the bundle2 stream
726 726 def getchunks(self):
727 727 if self.ui.debugflag:
728 728 msg = [b'bundle2-output-bundle: "%s",' % self._magicstring]
729 729 if self._params:
730 730 msg.append(b' (%i params)' % len(self._params))
731 731 msg.append(b' %i parts total\n' % len(self._parts))
732 732 self.ui.debug(b''.join(msg))
733 733 outdebug(self.ui, b'start emission of %s stream' % self._magicstring)
734 734 yield self._magicstring
735 735 param = self._paramchunk()
736 736 outdebug(self.ui, b'bundle parameter: %s' % param)
737 737 yield _pack(_fstreamparamsize, len(param))
738 738 if param:
739 739 yield param
740 740 for chunk in self._compengine.compressstream(
741 741 self._getcorechunk(), self._compopts
742 742 ):
743 743 yield chunk
744 744
745 745 def _paramchunk(self):
746 746 """return a encoded version of all stream parameters"""
747 747 blocks = []
748 748 for par, value in self._params:
749 749 par = urlreq.quote(par)
750 750 if value is not None:
751 751 value = urlreq.quote(value)
752 752 par = b'%s=%s' % (par, value)
753 753 blocks.append(par)
754 754 return b' '.join(blocks)
755 755
756 756 def _getcorechunk(self):
757 757 """yield chunk for the core part of the bundle
758 758
759 759 (all but headers and parameters)"""
760 760 outdebug(self.ui, b'start of parts')
761 761 for part in self._parts:
762 762 outdebug(self.ui, b'bundle part: "%s"' % part.type)
763 763 for chunk in part.getchunks(ui=self.ui):
764 764 yield chunk
765 765 outdebug(self.ui, b'end of bundle')
766 766 yield _pack(_fpartheadersize, 0)
767 767
768 768 def salvageoutput(self):
769 769 """return a list with a copy of all output parts in the bundle
770 770
771 771 This is meant to be used during error handling to make sure we preserve
772 772 server output"""
773 773 salvaged = []
774 774 for part in self._parts:
775 775 if part.type.startswith(b'output'):
776 776 salvaged.append(part.copy())
777 777 return salvaged
778 778
779 779
780 780 class unpackermixin:
781 781 """A mixin to extract bytes and struct data from a stream"""
782 782
783 783 def __init__(self, fp):
784 784 self._fp = fp
785 785
786 786 def _unpack(self, format):
787 787 """unpack this struct format from the stream
788 788
789 789 This method is meant for internal usage by the bundle2 protocol only.
790 790 They directly manipulate the low level stream including bundle2 level
791 791 instruction.
792 792
793 793 Do not use it to implement higher-level logic or methods."""
794 794 data = self._readexact(struct.calcsize(format))
795 795 return _unpack(format, data)
796 796
797 797 def _readexact(self, size):
798 798 """read exactly <size> bytes from the stream
799 799
800 800 This method is meant for internal usage by the bundle2 protocol only.
801 801 They directly manipulate the low level stream including bundle2 level
802 802 instruction.
803 803
804 804 Do not use it to implement higher-level logic or methods."""
805 805 return changegroup.readexactly(self._fp, size)
806 806
807 807
808 808 def getunbundler(ui, fp, magicstring=None):
809 809 """return a valid unbundler object for a given magicstring"""
810 810 if magicstring is None:
811 811 magicstring = changegroup.readexactly(fp, 4)
812 812 magic, version = magicstring[0:2], magicstring[2:4]
813 813 if magic != b'HG':
814 814 ui.debug(
815 815 b"error: invalid magic: %r (version %r), should be 'HG'\n"
816 816 % (magic, version)
817 817 )
818 818 raise error.Abort(_(b'not a Mercurial bundle'))
819 819 unbundlerclass = formatmap.get(version)
820 820 if unbundlerclass is None:
821 821 raise error.Abort(_(b'unknown bundle version %s') % version)
822 822 unbundler = unbundlerclass(ui, fp)
823 823 indebug(ui, b'start processing of %s stream' % magicstring)
824 824 return unbundler
825 825
826 826
827 827 class unbundle20(unpackermixin):
828 828 """interpret a bundle2 stream
829 829
830 830 This class is fed with a binary stream and yields parts through its
831 831 `iterparts` methods."""
832 832
833 833 _magicstring = b'HG20'
834 834
835 835 def __init__(self, ui, fp):
836 836 """If header is specified, we do not read it out of the stream."""
837 837 self.ui = ui
838 838 self._compengine = util.compengines.forbundletype(b'UN')
839 839 self._compressed = None
840 840 super(unbundle20, self).__init__(fp)
841 841
842 842 @util.propertycache
843 843 def params(self):
844 844 """dictionary of stream level parameters"""
845 845 indebug(self.ui, b'reading bundle2 stream parameters')
846 846 params = {}
847 847 paramssize = self._unpack(_fstreamparamsize)[0]
848 848 if paramssize < 0:
849 849 raise error.BundleValueError(
850 850 b'negative bundle param size: %i' % paramssize
851 851 )
852 852 if paramssize:
853 853 params = self._readexact(paramssize)
854 854 params = self._processallparams(params)
855 855 return params
856 856
857 857 def _processallparams(self, paramsblock):
858 858 """ """
859 859 params = util.sortdict()
860 860 for p in paramsblock.split(b' '):
861 861 p = p.split(b'=', 1)
862 862 p = [urlreq.unquote(i) for i in p]
863 863 if len(p) < 2:
864 864 p.append(None)
865 865 self._processparam(*p)
866 866 params[p[0]] = p[1]
867 867 return params
868 868
869 869 def _processparam(self, name, value):
870 870 """process a parameter, applying its effect if needed
871 871
872 872 Parameter starting with a lower case letter are advisory and will be
873 873 ignored when unknown. Those starting with an upper case letter are
874 874 mandatory and will this function will raise a KeyError when unknown.
875 875
876 876 Note: no option are currently supported. Any input will be either
877 877 ignored or failing.
878 878 """
879 879 if not name:
880 880 raise ValueError('empty parameter name')
881 881 if name[0:1] not in pycompat.bytestr(
882 882 string.ascii_letters # pytype: disable=wrong-arg-types
883 883 ):
884 884 raise ValueError('non letter first character: %s' % name)
885 885 try:
886 886 handler = b2streamparamsmap[name.lower()]
887 887 except KeyError:
888 888 if name[0:1].islower():
889 889 indebug(self.ui, b"ignoring unknown parameter %s" % name)
890 890 else:
891 891 raise error.BundleUnknownFeatureError(params=(name,))
892 892 else:
893 893 handler(self, name, value)
894 894
895 895 def _forwardchunks(self):
896 896 """utility to transfer a bundle2 as binary
897 897
898 898 This is made necessary by the fact the 'getbundle' command over 'ssh'
899 899 have no way to know then the reply end, relying on the bundle to be
900 900 interpreted to know its end. This is terrible and we are sorry, but we
901 901 needed to move forward to get general delta enabled.
902 902 """
903 903 yield self._magicstring
904 904 assert 'params' not in vars(self)
905 905 paramssize = self._unpack(_fstreamparamsize)[0]
906 906 if paramssize < 0:
907 907 raise error.BundleValueError(
908 908 b'negative bundle param size: %i' % paramssize
909 909 )
910 910 if paramssize:
911 911 params = self._readexact(paramssize)
912 912 self._processallparams(params)
913 913 # The payload itself is decompressed below, so drop
914 914 # the compression parameter passed down to compensate.
915 915 outparams = []
916 916 for p in params.split(b' '):
917 917 k, v = p.split(b'=', 1)
918 918 if k.lower() != b'compression':
919 919 outparams.append(p)
920 920 outparams = b' '.join(outparams)
921 921 yield _pack(_fstreamparamsize, len(outparams))
922 922 yield outparams
923 923 else:
924 924 yield _pack(_fstreamparamsize, paramssize)
925 925 # From there, payload might need to be decompressed
926 926 self._fp = self._compengine.decompressorreader(self._fp)
927 927 emptycount = 0
928 928 while emptycount < 2:
929 929 # so we can brainlessly loop
930 930 assert _fpartheadersize == _fpayloadsize
931 931 size = self._unpack(_fpartheadersize)[0]
932 932 yield _pack(_fpartheadersize, size)
933 933 if size:
934 934 emptycount = 0
935 935 else:
936 936 emptycount += 1
937 937 continue
938 938 if size == flaginterrupt:
939 939 continue
940 940 elif size < 0:
941 941 raise error.BundleValueError(b'negative chunk size: %i')
942 942 yield self._readexact(size)
943 943
944 944 def iterparts(self, seekable=False):
945 945 """yield all parts contained in the stream"""
946 946 cls = seekableunbundlepart if seekable else unbundlepart
947 947 # make sure param have been loaded
948 948 self.params
949 949 # From there, payload need to be decompressed
950 950 self._fp = self._compengine.decompressorreader(self._fp)
951 951 indebug(self.ui, b'start extraction of bundle2 parts')
952 952 headerblock = self._readpartheader()
953 953 while headerblock is not None:
954 954 part = cls(self.ui, headerblock, self._fp)
955 955 yield part
956 956 # Ensure part is fully consumed so we can start reading the next
957 957 # part.
958 958 part.consume()
959 959
960 960 headerblock = self._readpartheader()
961 961 indebug(self.ui, b'end of bundle2 stream')
962 962
963 963 def _readpartheader(self):
964 964 """reads a part header size and return the bytes blob
965 965
966 966 returns None if empty"""
967 967 headersize = self._unpack(_fpartheadersize)[0]
968 968 if headersize < 0:
969 969 raise error.BundleValueError(
970 970 b'negative part header size: %i' % headersize
971 971 )
972 972 indebug(self.ui, b'part header size: %i' % headersize)
973 973 if headersize:
974 974 return self._readexact(headersize)
975 975 return None
976 976
977 977 def compressed(self):
978 978 self.params # load params
979 979 return self._compressed
980 980
981 981 def close(self):
982 982 """close underlying file"""
983 983 if util.safehasattr(self._fp, 'close'):
984 984 return self._fp.close()
985 985
986 986
987 987 formatmap = {b'20': unbundle20}
988 988
989 989 b2streamparamsmap = {}
990 990
991 991
992 992 def b2streamparamhandler(name):
993 993 """register a handler for a stream level parameter"""
994 994
995 995 def decorator(func):
996 996 assert name not in formatmap
997 997 b2streamparamsmap[name] = func
998 998 return func
999 999
1000 1000 return decorator
1001 1001
1002 1002
1003 1003 @b2streamparamhandler(b'compression')
1004 1004 def processcompression(unbundler, param, value):
1005 1005 """read compression parameter and install payload decompression"""
1006 1006 if value not in util.compengines.supportedbundletypes:
1007 1007 raise error.BundleUnknownFeatureError(params=(param,), values=(value,))
1008 1008 unbundler._compengine = util.compengines.forbundletype(value)
1009 1009 if value is not None:
1010 1010 unbundler._compressed = True
1011 1011
1012 1012
1013 1013 class bundlepart:
1014 1014 """A bundle2 part contains application level payload
1015 1015
1016 1016 The part `type` is used to route the part to the application level
1017 1017 handler.
1018 1018
1019 1019 The part payload is contained in ``part.data``. It could be raw bytes or a
1020 1020 generator of byte chunks.
1021 1021
1022 1022 You can add parameters to the part using the ``addparam`` method.
1023 1023 Parameters can be either mandatory (default) or advisory. Remote side
1024 1024 should be able to safely ignore the advisory ones.
1025 1025
1026 1026 Both data and parameters cannot be modified after the generation has begun.
1027 1027 """
1028 1028
1029 1029 def __init__(
1030 1030 self,
1031 1031 parttype,
1032 1032 mandatoryparams=(),
1033 1033 advisoryparams=(),
1034 1034 data=b'',
1035 1035 mandatory=True,
1036 1036 ):
1037 1037 validateparttype(parttype)
1038 1038 self.id = None
1039 1039 self.type = parttype
1040 1040 self._data = data
1041 1041 self._mandatoryparams = list(mandatoryparams)
1042 1042 self._advisoryparams = list(advisoryparams)
1043 1043 # checking for duplicated entries
1044 1044 self._seenparams = set()
1045 1045 for pname, __ in self._mandatoryparams + self._advisoryparams:
1046 1046 if pname in self._seenparams:
1047 1047 raise error.ProgrammingError(b'duplicated params: %s' % pname)
1048 1048 self._seenparams.add(pname)
1049 1049 # status of the part's generation:
1050 1050 # - None: not started,
1051 1051 # - False: currently generated,
1052 1052 # - True: generation done.
1053 1053 self._generated = None
1054 1054 self.mandatory = mandatory
1055 1055
1056 1056 def __repr__(self):
1057 1057 cls = "%s.%s" % (self.__class__.__module__, self.__class__.__name__)
1058 1058 return '<%s object at %x; id: %s; type: %s; mandatory: %s>' % (
1059 1059 cls,
1060 1060 id(self),
1061 1061 self.id,
1062 1062 self.type,
1063 1063 self.mandatory,
1064 1064 )
1065 1065
1066 1066 def copy(self):
1067 1067 """return a copy of the part
1068 1068
1069 1069 The new part have the very same content but no partid assigned yet.
1070 1070 Parts with generated data cannot be copied."""
1071 1071 assert not util.safehasattr(self.data, 'next')
1072 1072 return self.__class__(
1073 1073 self.type,
1074 1074 self._mandatoryparams,
1075 1075 self._advisoryparams,
1076 1076 self._data,
1077 1077 self.mandatory,
1078 1078 )
1079 1079
1080 1080 # methods used to defines the part content
1081 1081 @property
1082 1082 def data(self):
1083 1083 return self._data
1084 1084
1085 1085 @data.setter
1086 1086 def data(self, data):
1087 1087 if self._generated is not None:
1088 1088 raise error.ReadOnlyPartError(b'part is being generated')
1089 1089 self._data = data
1090 1090
1091 1091 @property
1092 1092 def mandatoryparams(self):
1093 1093 # make it an immutable tuple to force people through ``addparam``
1094 1094 return tuple(self._mandatoryparams)
1095 1095
1096 1096 @property
1097 1097 def advisoryparams(self):
1098 1098 # make it an immutable tuple to force people through ``addparam``
1099 1099 return tuple(self._advisoryparams)
1100 1100
1101 1101 def addparam(self, name, value=b'', mandatory=True):
1102 1102 """add a parameter to the part
1103 1103
1104 1104 If 'mandatory' is set to True, the remote handler must claim support
1105 1105 for this parameter or the unbundling will be aborted.
1106 1106
1107 1107 The 'name' and 'value' cannot exceed 255 bytes each.
1108 1108 """
1109 1109 if self._generated is not None:
1110 1110 raise error.ReadOnlyPartError(b'part is being generated')
1111 1111 if name in self._seenparams:
1112 1112 raise ValueError(b'duplicated params: %s' % name)
1113 1113 self._seenparams.add(name)
1114 1114 params = self._advisoryparams
1115 1115 if mandatory:
1116 1116 params = self._mandatoryparams
1117 1117 params.append((name, value))
1118 1118
1119 1119 # methods used to generates the bundle2 stream
1120 1120 def getchunks(self, ui):
1121 1121 if self._generated is not None:
1122 1122 raise error.ProgrammingError(b'part can only be consumed once')
1123 1123 self._generated = False
1124 1124
1125 1125 if ui.debugflag:
1126 1126 msg = [b'bundle2-output-part: "%s"' % self.type]
1127 1127 if not self.mandatory:
1128 1128 msg.append(b' (advisory)')
1129 1129 nbmp = len(self.mandatoryparams)
1130 1130 nbap = len(self.advisoryparams)
1131 1131 if nbmp or nbap:
1132 1132 msg.append(b' (params:')
1133 1133 if nbmp:
1134 1134 msg.append(b' %i mandatory' % nbmp)
1135 1135 if nbap:
1136 1136 msg.append(b' %i advisory' % nbmp)
1137 1137 msg.append(b')')
1138 1138 if not self.data:
1139 1139 msg.append(b' empty payload')
1140 1140 elif util.safehasattr(self.data, 'next') or util.safehasattr(
1141 self.data, b'__next__'
1141 self.data, '__next__'
1142 1142 ):
1143 1143 msg.append(b' streamed payload')
1144 1144 else:
1145 1145 msg.append(b' %i bytes payload' % len(self.data))
1146 1146 msg.append(b'\n')
1147 1147 ui.debug(b''.join(msg))
1148 1148
1149 1149 #### header
1150 1150 if self.mandatory:
1151 1151 parttype = self.type.upper()
1152 1152 else:
1153 1153 parttype = self.type.lower()
1154 1154 outdebug(ui, b'part %s: "%s"' % (pycompat.bytestr(self.id), parttype))
1155 1155 ## parttype
1156 1156 header = [
1157 1157 _pack(_fparttypesize, len(parttype)),
1158 1158 parttype,
1159 1159 _pack(_fpartid, self.id),
1160 1160 ]
1161 1161 ## parameters
1162 1162 # count
1163 1163 manpar = self.mandatoryparams
1164 1164 advpar = self.advisoryparams
1165 1165 header.append(_pack(_fpartparamcount, len(manpar), len(advpar)))
1166 1166 # size
1167 1167 parsizes = []
1168 1168 for key, value in manpar:
1169 1169 parsizes.append(len(key))
1170 1170 parsizes.append(len(value))
1171 1171 for key, value in advpar:
1172 1172 parsizes.append(len(key))
1173 1173 parsizes.append(len(value))
1174 1174 paramsizes = _pack(_makefpartparamsizes(len(parsizes) // 2), *parsizes)
1175 1175 header.append(paramsizes)
1176 1176 # key, value
1177 1177 for key, value in manpar:
1178 1178 header.append(key)
1179 1179 header.append(value)
1180 1180 for key, value in advpar:
1181 1181 header.append(key)
1182 1182 header.append(value)
1183 1183 ## finalize header
1184 1184 try:
1185 1185 headerchunk = b''.join(header)
1186 1186 except TypeError:
1187 1187 raise TypeError(
1188 1188 'Found a non-bytes trying to '
1189 1189 'build bundle part header: %r' % header
1190 1190 )
1191 1191 outdebug(ui, b'header chunk size: %i' % len(headerchunk))
1192 1192 yield _pack(_fpartheadersize, len(headerchunk))
1193 1193 yield headerchunk
1194 1194 ## payload
1195 1195 try:
1196 1196 for chunk in self._payloadchunks():
1197 1197 outdebug(ui, b'payload chunk size: %i' % len(chunk))
1198 1198 yield _pack(_fpayloadsize, len(chunk))
1199 1199 yield chunk
1200 1200 except GeneratorExit:
1201 1201 # GeneratorExit means that nobody is listening for our
1202 1202 # results anyway, so just bail quickly rather than trying
1203 1203 # to produce an error part.
1204 1204 ui.debug(b'bundle2-generatorexit\n')
1205 1205 raise
1206 1206 except BaseException as exc:
1207 1207 bexc = stringutil.forcebytestr(exc)
1208 1208 # backup exception data for later
1209 1209 ui.debug(
1210 1210 b'bundle2-input-stream-interrupt: encoding exception %s' % bexc
1211 1211 )
1212 1212 tb = sys.exc_info()[2]
1213 1213 msg = b'unexpected error: %s' % bexc
1214 1214 interpart = bundlepart(
1215 1215 b'error:abort', [(b'message', msg)], mandatory=False
1216 1216 )
1217 1217 interpart.id = 0
1218 1218 yield _pack(_fpayloadsize, -1)
1219 1219 for chunk in interpart.getchunks(ui=ui):
1220 1220 yield chunk
1221 1221 outdebug(ui, b'closing payload chunk')
1222 1222 # abort current part payload
1223 1223 yield _pack(_fpayloadsize, 0)
1224 1224 pycompat.raisewithtb(exc, tb)
1225 1225 # end of payload
1226 1226 outdebug(ui, b'closing payload chunk')
1227 1227 yield _pack(_fpayloadsize, 0)
1228 1228 self._generated = True
1229 1229
1230 1230 def _payloadchunks(self):
1231 1231 """yield chunks of a the part payload
1232 1232
1233 1233 Exists to handle the different methods to provide data to a part."""
1234 1234 # we only support fixed size data now.
1235 1235 # This will be improved in the future.
1236 1236 if util.safehasattr(self.data, 'next') or util.safehasattr(
1237 1237 self.data, '__next__'
1238 1238 ):
1239 1239 buff = util.chunkbuffer(self.data)
1240 1240 chunk = buff.read(preferedchunksize)
1241 1241 while chunk:
1242 1242 yield chunk
1243 1243 chunk = buff.read(preferedchunksize)
1244 1244 elif len(self.data):
1245 1245 yield self.data
1246 1246
1247 1247
1248 1248 flaginterrupt = -1
1249 1249
1250 1250
1251 1251 class interrupthandler(unpackermixin):
1252 1252 """read one part and process it with restricted capability
1253 1253
1254 1254 This allows to transmit exception raised on the producer size during part
1255 1255 iteration while the consumer is reading a part.
1256 1256
1257 1257 Part processed in this manner only have access to a ui object,"""
1258 1258
1259 1259 def __init__(self, ui, fp):
1260 1260 super(interrupthandler, self).__init__(fp)
1261 1261 self.ui = ui
1262 1262
1263 1263 def _readpartheader(self):
1264 1264 """reads a part header size and return the bytes blob
1265 1265
1266 1266 returns None if empty"""
1267 1267 headersize = self._unpack(_fpartheadersize)[0]
1268 1268 if headersize < 0:
1269 1269 raise error.BundleValueError(
1270 1270 b'negative part header size: %i' % headersize
1271 1271 )
1272 1272 indebug(self.ui, b'part header size: %i\n' % headersize)
1273 1273 if headersize:
1274 1274 return self._readexact(headersize)
1275 1275 return None
1276 1276
1277 1277 def __call__(self):
1278 1278
1279 1279 self.ui.debug(
1280 1280 b'bundle2-input-stream-interrupt: opening out of band context\n'
1281 1281 )
1282 1282 indebug(self.ui, b'bundle2 stream interruption, looking for a part.')
1283 1283 headerblock = self._readpartheader()
1284 1284 if headerblock is None:
1285 1285 indebug(self.ui, b'no part found during interruption.')
1286 1286 return
1287 1287 part = unbundlepart(self.ui, headerblock, self._fp)
1288 1288 op = interruptoperation(self.ui)
1289 1289 hardabort = False
1290 1290 try:
1291 1291 _processpart(op, part)
1292 1292 except (SystemExit, KeyboardInterrupt):
1293 1293 hardabort = True
1294 1294 raise
1295 1295 finally:
1296 1296 if not hardabort:
1297 1297 part.consume()
1298 1298 self.ui.debug(
1299 1299 b'bundle2-input-stream-interrupt: closing out of band context\n'
1300 1300 )
1301 1301
1302 1302
1303 1303 class interruptoperation:
1304 1304 """A limited operation to be use by part handler during interruption
1305 1305
1306 1306 It only have access to an ui object.
1307 1307 """
1308 1308
1309 1309 def __init__(self, ui):
1310 1310 self.ui = ui
1311 1311 self.reply = None
1312 1312 self.captureoutput = False
1313 1313
1314 1314 @property
1315 1315 def repo(self):
1316 1316 raise error.ProgrammingError(b'no repo access from stream interruption')
1317 1317
1318 1318 def gettransaction(self):
1319 1319 raise TransactionUnavailable(b'no repo access from stream interruption')
1320 1320
1321 1321
1322 1322 def decodepayloadchunks(ui, fh):
1323 1323 """Reads bundle2 part payload data into chunks.
1324 1324
1325 1325 Part payload data consists of framed chunks. This function takes
1326 1326 a file handle and emits those chunks.
1327 1327 """
1328 1328 dolog = ui.configbool(b'devel', b'bundle2.debug')
1329 1329 debug = ui.debug
1330 1330
1331 1331 headerstruct = struct.Struct(_fpayloadsize)
1332 1332 headersize = headerstruct.size
1333 1333 unpack = headerstruct.unpack
1334 1334
1335 1335 readexactly = changegroup.readexactly
1336 1336 read = fh.read
1337 1337
1338 1338 chunksize = unpack(readexactly(fh, headersize))[0]
1339 1339 indebug(ui, b'payload chunk size: %i' % chunksize)
1340 1340
1341 1341 # changegroup.readexactly() is inlined below for performance.
1342 1342 while chunksize:
1343 1343 if chunksize >= 0:
1344 1344 s = read(chunksize)
1345 1345 if len(s) < chunksize:
1346 1346 raise error.Abort(
1347 1347 _(
1348 1348 b'stream ended unexpectedly '
1349 1349 b' (got %d bytes, expected %d)'
1350 1350 )
1351 1351 % (len(s), chunksize)
1352 1352 )
1353 1353
1354 1354 yield s
1355 1355 elif chunksize == flaginterrupt:
1356 1356 # Interrupt "signal" detected. The regular stream is interrupted
1357 1357 # and a bundle2 part follows. Consume it.
1358 1358 interrupthandler(ui, fh)()
1359 1359 else:
1360 1360 raise error.BundleValueError(
1361 1361 b'negative payload chunk size: %s' % chunksize
1362 1362 )
1363 1363
1364 1364 s = read(headersize)
1365 1365 if len(s) < headersize:
1366 1366 raise error.Abort(
1367 1367 _(b'stream ended unexpectedly (got %d bytes, expected %d)')
1368 1368 % (len(s), chunksize)
1369 1369 )
1370 1370
1371 1371 chunksize = unpack(s)[0]
1372 1372
1373 1373 # indebug() inlined for performance.
1374 1374 if dolog:
1375 1375 debug(b'bundle2-input: payload chunk size: %i\n' % chunksize)
1376 1376
1377 1377
1378 1378 class unbundlepart(unpackermixin):
1379 1379 """a bundle part read from a bundle"""
1380 1380
1381 1381 def __init__(self, ui, header, fp):
1382 1382 super(unbundlepart, self).__init__(fp)
1383 1383 self._seekable = util.safehasattr(fp, 'seek') and util.safehasattr(
1384 1384 fp, 'tell'
1385 1385 )
1386 1386 self.ui = ui
1387 1387 # unbundle state attr
1388 1388 self._headerdata = header
1389 1389 self._headeroffset = 0
1390 1390 self._initialized = False
1391 1391 self.consumed = False
1392 1392 # part data
1393 1393 self.id = None
1394 1394 self.type = None
1395 1395 self.mandatoryparams = None
1396 1396 self.advisoryparams = None
1397 1397 self.params = None
1398 1398 self.mandatorykeys = ()
1399 1399 self._readheader()
1400 1400 self._mandatory = None
1401 1401 self._pos = 0
1402 1402
1403 1403 def _fromheader(self, size):
1404 1404 """return the next <size> byte from the header"""
1405 1405 offset = self._headeroffset
1406 1406 data = self._headerdata[offset : (offset + size)]
1407 1407 self._headeroffset = offset + size
1408 1408 return data
1409 1409
1410 1410 def _unpackheader(self, format):
1411 1411 """read given format from header
1412 1412
1413 1413 This automatically compute the size of the format to read."""
1414 1414 data = self._fromheader(struct.calcsize(format))
1415 1415 return _unpack(format, data)
1416 1416
1417 1417 def _initparams(self, mandatoryparams, advisoryparams):
1418 1418 """internal function to setup all logic related parameters"""
1419 1419 # make it read only to prevent people touching it by mistake.
1420 1420 self.mandatoryparams = tuple(mandatoryparams)
1421 1421 self.advisoryparams = tuple(advisoryparams)
1422 1422 # user friendly UI
1423 1423 self.params = util.sortdict(self.mandatoryparams)
1424 1424 self.params.update(self.advisoryparams)
1425 1425 self.mandatorykeys = frozenset(p[0] for p in mandatoryparams)
1426 1426
1427 1427 def _readheader(self):
1428 1428 """read the header and setup the object"""
1429 1429 typesize = self._unpackheader(_fparttypesize)[0]
1430 1430 self.type = self._fromheader(typesize)
1431 1431 indebug(self.ui, b'part type: "%s"' % self.type)
1432 1432 self.id = self._unpackheader(_fpartid)[0]
1433 1433 indebug(self.ui, b'part id: "%s"' % pycompat.bytestr(self.id))
1434 1434 # extract mandatory bit from type
1435 1435 self.mandatory = self.type != self.type.lower()
1436 1436 self.type = self.type.lower()
1437 1437 ## reading parameters
1438 1438 # param count
1439 1439 mancount, advcount = self._unpackheader(_fpartparamcount)
1440 1440 indebug(self.ui, b'part parameters: %i' % (mancount + advcount))
1441 1441 # param size
1442 1442 fparamsizes = _makefpartparamsizes(mancount + advcount)
1443 1443 paramsizes = self._unpackheader(fparamsizes)
1444 1444 # make it a list of couple again
1445 1445 paramsizes = list(zip(paramsizes[::2], paramsizes[1::2]))
1446 1446 # split mandatory from advisory
1447 1447 mansizes = paramsizes[:mancount]
1448 1448 advsizes = paramsizes[mancount:]
1449 1449 # retrieve param value
1450 1450 manparams = []
1451 1451 for key, value in mansizes:
1452 1452 manparams.append((self._fromheader(key), self._fromheader(value)))
1453 1453 advparams = []
1454 1454 for key, value in advsizes:
1455 1455 advparams.append((self._fromheader(key), self._fromheader(value)))
1456 1456 self._initparams(manparams, advparams)
1457 1457 ## part payload
1458 1458 self._payloadstream = util.chunkbuffer(self._payloadchunks())
1459 1459 # we read the data, tell it
1460 1460 self._initialized = True
1461 1461
1462 1462 def _payloadchunks(self):
1463 1463 """Generator of decoded chunks in the payload."""
1464 1464 return decodepayloadchunks(self.ui, self._fp)
1465 1465
1466 1466 def consume(self):
1467 1467 """Read the part payload until completion.
1468 1468
1469 1469 By consuming the part data, the underlying stream read offset will
1470 1470 be advanced to the next part (or end of stream).
1471 1471 """
1472 1472 if self.consumed:
1473 1473 return
1474 1474
1475 1475 chunk = self.read(32768)
1476 1476 while chunk:
1477 1477 self._pos += len(chunk)
1478 1478 chunk = self.read(32768)
1479 1479
1480 1480 def read(self, size=None):
1481 1481 """read payload data"""
1482 1482 if not self._initialized:
1483 1483 self._readheader()
1484 1484 if size is None:
1485 1485 data = self._payloadstream.read()
1486 1486 else:
1487 1487 data = self._payloadstream.read(size)
1488 1488 self._pos += len(data)
1489 1489 if size is None or len(data) < size:
1490 1490 if not self.consumed and self._pos:
1491 1491 self.ui.debug(
1492 1492 b'bundle2-input-part: total payload size %i\n' % self._pos
1493 1493 )
1494 1494 self.consumed = True
1495 1495 return data
1496 1496
1497 1497
1498 1498 class seekableunbundlepart(unbundlepart):
1499 1499 """A bundle2 part in a bundle that is seekable.
1500 1500
1501 1501 Regular ``unbundlepart`` instances can only be read once. This class
1502 1502 extends ``unbundlepart`` to enable bi-directional seeking within the
1503 1503 part.
1504 1504
1505 1505 Bundle2 part data consists of framed chunks. Offsets when seeking
1506 1506 refer to the decoded data, not the offsets in the underlying bundle2
1507 1507 stream.
1508 1508
1509 1509 To facilitate quickly seeking within the decoded data, instances of this
1510 1510 class maintain a mapping between offsets in the underlying stream and
1511 1511 the decoded payload. This mapping will consume memory in proportion
1512 1512 to the number of chunks within the payload (which almost certainly
1513 1513 increases in proportion with the size of the part).
1514 1514 """
1515 1515
1516 1516 def __init__(self, ui, header, fp):
1517 1517 # (payload, file) offsets for chunk starts.
1518 1518 self._chunkindex = []
1519 1519
1520 1520 super(seekableunbundlepart, self).__init__(ui, header, fp)
1521 1521
1522 1522 def _payloadchunks(self, chunknum=0):
1523 1523 '''seek to specified chunk and start yielding data'''
1524 1524 if len(self._chunkindex) == 0:
1525 1525 assert chunknum == 0, b'Must start with chunk 0'
1526 1526 self._chunkindex.append((0, self._tellfp()))
1527 1527 else:
1528 1528 assert chunknum < len(self._chunkindex), (
1529 1529 b'Unknown chunk %d' % chunknum
1530 1530 )
1531 1531 self._seekfp(self._chunkindex[chunknum][1])
1532 1532
1533 1533 pos = self._chunkindex[chunknum][0]
1534 1534
1535 1535 for chunk in decodepayloadchunks(self.ui, self._fp):
1536 1536 chunknum += 1
1537 1537 pos += len(chunk)
1538 1538 if chunknum == len(self._chunkindex):
1539 1539 self._chunkindex.append((pos, self._tellfp()))
1540 1540
1541 1541 yield chunk
1542 1542
1543 1543 def _findchunk(self, pos):
1544 1544 '''for a given payload position, return a chunk number and offset'''
1545 1545 for chunk, (ppos, fpos) in enumerate(self._chunkindex):
1546 1546 if ppos == pos:
1547 1547 return chunk, 0
1548 1548 elif ppos > pos:
1549 1549 return chunk - 1, pos - self._chunkindex[chunk - 1][0]
1550 1550 raise ValueError(b'Unknown chunk')
1551 1551
1552 1552 def tell(self):
1553 1553 return self._pos
1554 1554
1555 1555 def seek(self, offset, whence=os.SEEK_SET):
1556 1556 if whence == os.SEEK_SET:
1557 1557 newpos = offset
1558 1558 elif whence == os.SEEK_CUR:
1559 1559 newpos = self._pos + offset
1560 1560 elif whence == os.SEEK_END:
1561 1561 if not self.consumed:
1562 1562 # Can't use self.consume() here because it advances self._pos.
1563 1563 chunk = self.read(32768)
1564 1564 while chunk:
1565 1565 chunk = self.read(32768)
1566 1566 newpos = self._chunkindex[-1][0] - offset
1567 1567 else:
1568 1568 raise ValueError(b'Unknown whence value: %r' % (whence,))
1569 1569
1570 1570 if newpos > self._chunkindex[-1][0] and not self.consumed:
1571 1571 # Can't use self.consume() here because it advances self._pos.
1572 1572 chunk = self.read(32768)
1573 1573 while chunk:
1574 1574 chunk = self.read(32668)
1575 1575
1576 1576 if not 0 <= newpos <= self._chunkindex[-1][0]:
1577 1577 raise ValueError(b'Offset out of range')
1578 1578
1579 1579 if self._pos != newpos:
1580 1580 chunk, internaloffset = self._findchunk(newpos)
1581 1581 self._payloadstream = util.chunkbuffer(self._payloadchunks(chunk))
1582 1582 adjust = self.read(internaloffset)
1583 1583 if len(adjust) != internaloffset:
1584 1584 raise error.Abort(_(b'Seek failed\n'))
1585 1585 self._pos = newpos
1586 1586
1587 1587 def _seekfp(self, offset, whence=0):
1588 1588 """move the underlying file pointer
1589 1589
1590 1590 This method is meant for internal usage by the bundle2 protocol only.
1591 1591 They directly manipulate the low level stream including bundle2 level
1592 1592 instruction.
1593 1593
1594 1594 Do not use it to implement higher-level logic or methods."""
1595 1595 if self._seekable:
1596 1596 return self._fp.seek(offset, whence)
1597 1597 else:
1598 1598 raise NotImplementedError(_(b'File pointer is not seekable'))
1599 1599
1600 1600 def _tellfp(self):
1601 1601 """return the file offset, or None if file is not seekable
1602 1602
1603 1603 This method is meant for internal usage by the bundle2 protocol only.
1604 1604 They directly manipulate the low level stream including bundle2 level
1605 1605 instruction.
1606 1606
1607 1607 Do not use it to implement higher-level logic or methods."""
1608 1608 if self._seekable:
1609 1609 try:
1610 1610 return self._fp.tell()
1611 1611 except IOError as e:
1612 1612 if e.errno == errno.ESPIPE:
1613 1613 self._seekable = False
1614 1614 else:
1615 1615 raise
1616 1616 return None
1617 1617
1618 1618
1619 1619 # These are only the static capabilities.
1620 1620 # Check the 'getrepocaps' function for the rest.
1621 1621 capabilities = {
1622 1622 b'HG20': (),
1623 1623 b'bookmarks': (),
1624 1624 b'error': (b'abort', b'unsupportedcontent', b'pushraced', b'pushkey'),
1625 1625 b'listkeys': (),
1626 1626 b'pushkey': (),
1627 1627 b'digests': tuple(sorted(util.DIGESTS.keys())),
1628 1628 b'remote-changegroup': (b'http', b'https'),
1629 1629 b'hgtagsfnodes': (),
1630 1630 b'phases': (b'heads',),
1631 1631 b'stream': (b'v2',),
1632 1632 }
1633 1633
1634 1634
1635 1635 def getrepocaps(repo, allowpushback=False, role=None):
1636 1636 """return the bundle2 capabilities for a given repo
1637 1637
1638 1638 Exists to allow extensions (like evolution) to mutate the capabilities.
1639 1639
1640 1640 The returned value is used for servers advertising their capabilities as
1641 1641 well as clients advertising their capabilities to servers as part of
1642 1642 bundle2 requests. The ``role`` argument specifies which is which.
1643 1643 """
1644 1644 if role not in (b'client', b'server'):
1645 1645 raise error.ProgrammingError(b'role argument must be client or server')
1646 1646
1647 1647 caps = capabilities.copy()
1648 1648 caps[b'changegroup'] = tuple(
1649 1649 sorted(changegroup.supportedincomingversions(repo))
1650 1650 )
1651 1651 if obsolete.isenabled(repo, obsolete.exchangeopt):
1652 1652 supportedformat = tuple(b'V%i' % v for v in obsolete.formats)
1653 1653 caps[b'obsmarkers'] = supportedformat
1654 1654 if allowpushback:
1655 1655 caps[b'pushback'] = ()
1656 1656 cpmode = repo.ui.config(b'server', b'concurrent-push-mode')
1657 1657 if cpmode == b'check-related':
1658 1658 caps[b'checkheads'] = (b'related',)
1659 1659 if b'phases' in repo.ui.configlist(b'devel', b'legacy.exchange'):
1660 1660 caps.pop(b'phases')
1661 1661
1662 1662 # Don't advertise stream clone support in server mode if not configured.
1663 1663 if role == b'server':
1664 1664 streamsupported = repo.ui.configbool(
1665 1665 b'server', b'uncompressed', untrusted=True
1666 1666 )
1667 1667 featuresupported = repo.ui.configbool(b'server', b'bundle2.stream')
1668 1668
1669 1669 if not streamsupported or not featuresupported:
1670 1670 caps.pop(b'stream')
1671 1671 # Else always advertise support on client, because payload support
1672 1672 # should always be advertised.
1673 1673
1674 1674 if repo.ui.configbool(b'experimental', b'stream-v3'):
1675 1675 if b'stream' in caps:
1676 1676 caps[b'stream'] += (b'v3-exp',)
1677 1677
1678 1678 # b'rev-branch-cache is no longer advertised, but still supported
1679 1679 # for legacy clients.
1680 1680
1681 1681 return caps
1682 1682
1683 1683
1684 1684 def bundle2caps(remote):
1685 1685 """return the bundle capabilities of a peer as dict"""
1686 1686 raw = remote.capable(b'bundle2')
1687 1687 if not raw and raw != b'':
1688 1688 return {}
1689 1689 capsblob = urlreq.unquote(remote.capable(b'bundle2'))
1690 1690 return decodecaps(capsblob)
1691 1691
1692 1692
1693 1693 def obsmarkersversion(caps):
1694 1694 """extract the list of supported obsmarkers versions from a bundle2caps dict"""
1695 1695 obscaps = caps.get(b'obsmarkers', ())
1696 1696 return [int(c[1:]) for c in obscaps if c.startswith(b'V')]
1697 1697
1698 1698
1699 1699 def writenewbundle(
1700 1700 ui,
1701 1701 repo,
1702 1702 source,
1703 1703 filename,
1704 1704 bundletype,
1705 1705 outgoing,
1706 1706 opts,
1707 1707 vfs=None,
1708 1708 compression=None,
1709 1709 compopts=None,
1710 1710 allow_internal=False,
1711 1711 ):
1712 1712 if bundletype.startswith(b'HG10'):
1713 1713 cg = changegroup.makechangegroup(repo, outgoing, b'01', source)
1714 1714 return writebundle(
1715 1715 ui,
1716 1716 cg,
1717 1717 filename,
1718 1718 bundletype,
1719 1719 vfs=vfs,
1720 1720 compression=compression,
1721 1721 compopts=compopts,
1722 1722 )
1723 1723 elif not bundletype.startswith(b'HG20'):
1724 1724 raise error.ProgrammingError(b'unknown bundle type: %s' % bundletype)
1725 1725
1726 1726 # enforce that no internal phase are to be bundled
1727 1727 bundled_internal = repo.revs(b"%ln and _internal()", outgoing.ancestorsof)
1728 1728 if bundled_internal and not allow_internal:
1729 1729 count = len(repo.revs(b'%ln and _internal()', outgoing.missing))
1730 1730 msg = "backup bundle would contains %d internal changesets"
1731 1731 msg %= count
1732 1732 raise error.ProgrammingError(msg)
1733 1733
1734 1734 caps = {}
1735 1735 if opts.get(b'obsolescence', False):
1736 1736 caps[b'obsmarkers'] = (b'V1',)
1737 1737 if opts.get(b'streamv2'):
1738 1738 caps[b'stream'] = [b'v2']
1739 1739 elif opts.get(b'streamv3-exp'):
1740 1740 caps[b'stream'] = [b'v3-exp']
1741 1741 bundle = bundle20(ui, caps)
1742 1742 bundle.setcompression(compression, compopts)
1743 1743 _addpartsfromopts(ui, repo, bundle, source, outgoing, opts)
1744 1744 chunkiter = bundle.getchunks()
1745 1745
1746 1746 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
1747 1747
1748 1748
1749 1749 def _addpartsfromopts(ui, repo, bundler, source, outgoing, opts):
1750 1750 # We should eventually reconcile this logic with the one behind
1751 1751 # 'exchange.getbundle2partsgenerator'.
1752 1752 #
1753 1753 # The type of input from 'getbundle' and 'writenewbundle' are a bit
1754 1754 # different right now. So we keep them separated for now for the sake of
1755 1755 # simplicity.
1756 1756
1757 1757 # we might not always want a changegroup in such bundle, for example in
1758 1758 # stream bundles
1759 1759 if opts.get(b'changegroup', True):
1760 1760 cgversion = opts.get(b'cg.version')
1761 1761 if cgversion is None:
1762 1762 cgversion = changegroup.safeversion(repo)
1763 1763 cg = changegroup.makechangegroup(repo, outgoing, cgversion, source)
1764 1764 part = bundler.newpart(b'changegroup', data=cg.getchunks())
1765 1765 part.addparam(b'version', cg.version)
1766 1766 if b'clcount' in cg.extras:
1767 1767 part.addparam(
1768 1768 b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
1769 1769 )
1770 1770 if opts.get(b'phases'):
1771 1771 target_phase = phases.draft
1772 1772 for head in outgoing.ancestorsof:
1773 1773 target_phase = max(target_phase, repo[head].phase())
1774 1774 if target_phase > phases.draft:
1775 1775 part.addparam(
1776 1776 b'targetphase',
1777 1777 b'%d' % target_phase,
1778 1778 mandatory=False,
1779 1779 )
1780 1780 if repository.REPO_FEATURE_SIDE_DATA in repo.features:
1781 1781 part.addparam(b'exp-sidedata', b'1')
1782 1782
1783 1783 if opts.get(b'streamv2', False):
1784 1784 addpartbundlestream2(bundler, repo, stream=True)
1785 1785
1786 1786 if opts.get(b'streamv3-exp', False):
1787 1787 addpartbundlestream2(bundler, repo, stream=True)
1788 1788
1789 1789 if opts.get(b'tagsfnodescache', True):
1790 1790 addparttagsfnodescache(repo, bundler, outgoing)
1791 1791
1792 1792 if opts.get(b'revbranchcache', True):
1793 1793 addpartrevbranchcache(repo, bundler, outgoing)
1794 1794
1795 1795 if opts.get(b'obsolescence', False):
1796 1796 obsmarkers = repo.obsstore.relevantmarkers(outgoing.missing)
1797 1797 buildobsmarkerspart(
1798 1798 bundler,
1799 1799 obsmarkers,
1800 1800 mandatory=opts.get(b'obsolescence-mandatory', True),
1801 1801 )
1802 1802
1803 1803 if opts.get(b'phases', False):
1804 1804 headsbyphase = phases.subsetphaseheads(repo, outgoing.missing)
1805 1805 phasedata = phases.binaryencode(headsbyphase)
1806 1806 bundler.newpart(b'phase-heads', data=phasedata)
1807 1807
1808 1808
1809 1809 def addparttagsfnodescache(repo, bundler, outgoing):
1810 1810 # we include the tags fnode cache for the bundle changeset
1811 1811 # (as an optional parts)
1812 1812 cache = tags.hgtagsfnodescache(repo.unfiltered())
1813 1813 chunks = []
1814 1814
1815 1815 # .hgtags fnodes are only relevant for head changesets. While we could
1816 1816 # transfer values for all known nodes, there will likely be little to
1817 1817 # no benefit.
1818 1818 #
1819 1819 # We don't bother using a generator to produce output data because
1820 1820 # a) we only have 40 bytes per head and even esoteric numbers of heads
1821 1821 # consume little memory (1M heads is 40MB) b) we don't want to send the
1822 1822 # part if we don't have entries and knowing if we have entries requires
1823 1823 # cache lookups.
1824 1824 for node in outgoing.ancestorsof:
1825 1825 # Don't compute missing, as this may slow down serving.
1826 1826 fnode = cache.getfnode(node, computemissing=False)
1827 1827 if fnode:
1828 1828 chunks.extend([node, fnode])
1829 1829
1830 1830 if chunks:
1831 1831 bundler.newpart(b'hgtagsfnodes', data=b''.join(chunks))
1832 1832
1833 1833
1834 1834 def addpartrevbranchcache(repo, bundler, outgoing):
1835 1835 # we include the rev branch cache for the bundle changeset
1836 1836 # (as an optional parts)
1837 1837 cache = repo.revbranchcache()
1838 1838 cl = repo.unfiltered().changelog
1839 1839 branchesdata = collections.defaultdict(lambda: (set(), set()))
1840 1840 for node in outgoing.missing:
1841 1841 branch, close = cache.branchinfo(cl.rev(node))
1842 1842 branchesdata[branch][close].add(node)
1843 1843
1844 1844 def generate():
1845 1845 for branch, (nodes, closed) in sorted(branchesdata.items()):
1846 1846 utf8branch = encoding.fromlocal(branch)
1847 1847 yield rbcstruct.pack(len(utf8branch), len(nodes), len(closed))
1848 1848 yield utf8branch
1849 1849 for n in sorted(nodes):
1850 1850 yield n
1851 1851 for n in sorted(closed):
1852 1852 yield n
1853 1853
1854 1854 bundler.newpart(b'cache:rev-branch-cache', data=generate(), mandatory=False)
1855 1855
1856 1856
1857 1857 def _formatrequirementsspec(requirements):
1858 1858 requirements = [req for req in requirements if req != b"shared"]
1859 1859 return urlreq.quote(b','.join(sorted(requirements)))
1860 1860
1861 1861
1862 1862 def _formatrequirementsparams(requirements):
1863 1863 requirements = _formatrequirementsspec(requirements)
1864 1864 params = b"%s%s" % (urlreq.quote(b"requirements="), requirements)
1865 1865 return params
1866 1866
1867 1867
1868 1868 def format_remote_wanted_sidedata(repo):
1869 1869 """Formats a repo's wanted sidedata categories into a bytestring for
1870 1870 capabilities exchange."""
1871 1871 wanted = b""
1872 1872 if repo._wanted_sidedata:
1873 1873 wanted = b','.join(
1874 1874 pycompat.bytestr(c) for c in sorted(repo._wanted_sidedata)
1875 1875 )
1876 1876 return wanted
1877 1877
1878 1878
1879 1879 def read_remote_wanted_sidedata(remote):
1880 1880 sidedata_categories = remote.capable(b'exp-wanted-sidedata')
1881 1881 return read_wanted_sidedata(sidedata_categories)
1882 1882
1883 1883
1884 1884 def read_wanted_sidedata(formatted):
1885 1885 if formatted:
1886 1886 return set(formatted.split(b','))
1887 1887 return set()
1888 1888
1889 1889
1890 1890 def addpartbundlestream2(bundler, repo, **kwargs):
1891 1891 if not kwargs.get('stream', False):
1892 1892 return
1893 1893
1894 1894 if not streamclone.allowservergeneration(repo):
1895 1895 msg = _(b'stream data requested but server does not allow this feature')
1896 1896 hint = _(b'the client seems buggy')
1897 1897 raise error.Abort(msg, hint=hint)
1898 1898 if not (b'stream' in bundler.capabilities):
1899 1899 msg = _(
1900 1900 b'stream data requested but supported streaming clone versions were not specified'
1901 1901 )
1902 1902 hint = _(b'the client seems buggy')
1903 1903 raise error.Abort(msg, hint=hint)
1904 1904 client_supported = set(bundler.capabilities[b'stream'])
1905 1905 server_supported = set(getrepocaps(repo, role=b'client').get(b'stream', []))
1906 1906 common_supported = client_supported & server_supported
1907 1907 if not common_supported:
1908 1908 msg = _(b'no common supported version with the client: %s; %s')
1909 1909 str_server = b','.join(sorted(server_supported))
1910 1910 str_client = b','.join(sorted(client_supported))
1911 1911 msg %= (str_server, str_client)
1912 1912 raise error.Abort(msg)
1913 1913 version = max(common_supported)
1914 1914
1915 1915 # Stream clones don't compress well. And compression undermines a
1916 1916 # goal of stream clones, which is to be fast. Communicate the desire
1917 1917 # to avoid compression to consumers of the bundle.
1918 1918 bundler.prefercompressed = False
1919 1919
1920 1920 # get the includes and excludes
1921 1921 includepats = kwargs.get('includepats')
1922 1922 excludepats = kwargs.get('excludepats')
1923 1923
1924 1924 narrowstream = repo.ui.configbool(
1925 1925 b'experimental', b'server.stream-narrow-clones'
1926 1926 )
1927 1927
1928 1928 if (includepats or excludepats) and not narrowstream:
1929 1929 raise error.Abort(_(b'server does not support narrow stream clones'))
1930 1930
1931 1931 includeobsmarkers = False
1932 1932 if repo.obsstore:
1933 1933 remoteversions = obsmarkersversion(bundler.capabilities)
1934 1934 if not remoteversions:
1935 1935 raise error.Abort(
1936 1936 _(
1937 1937 b'server has obsolescence markers, but client '
1938 1938 b'cannot receive them via stream clone'
1939 1939 )
1940 1940 )
1941 1941 elif repo.obsstore._version in remoteversions:
1942 1942 includeobsmarkers = True
1943 1943
1944 1944 if version == b"v2":
1945 1945 filecount, bytecount, it = streamclone.generatev2(
1946 1946 repo, includepats, excludepats, includeobsmarkers
1947 1947 )
1948 1948 requirements = streamclone.streamed_requirements(repo)
1949 1949 requirements = _formatrequirementsspec(requirements)
1950 1950 part = bundler.newpart(b'stream2', data=it)
1951 1951 part.addparam(b'bytecount', b'%d' % bytecount, mandatory=True)
1952 1952 part.addparam(b'filecount', b'%d' % filecount, mandatory=True)
1953 1953 part.addparam(b'requirements', requirements, mandatory=True)
1954 1954 elif version == b"v3-exp":
1955 1955 it = streamclone.generatev3(
1956 1956 repo, includepats, excludepats, includeobsmarkers
1957 1957 )
1958 1958 requirements = streamclone.streamed_requirements(repo)
1959 1959 requirements = _formatrequirementsspec(requirements)
1960 1960 part = bundler.newpart(b'stream3-exp', data=it)
1961 1961 part.addparam(b'requirements', requirements, mandatory=True)
1962 1962
1963 1963
1964 1964 def buildobsmarkerspart(bundler, markers, mandatory=True):
1965 1965 """add an obsmarker part to the bundler with <markers>
1966 1966
1967 1967 No part is created if markers is empty.
1968 1968 Raises ValueError if the bundler doesn't support any known obsmarker format.
1969 1969 """
1970 1970 if not markers:
1971 1971 return None
1972 1972
1973 1973 remoteversions = obsmarkersversion(bundler.capabilities)
1974 1974 version = obsolete.commonversion(remoteversions)
1975 1975 if version is None:
1976 1976 raise ValueError(b'bundler does not support common obsmarker format')
1977 1977 stream = obsolete.encodemarkers(markers, True, version=version)
1978 1978 return bundler.newpart(b'obsmarkers', data=stream, mandatory=mandatory)
1979 1979
1980 1980
1981 1981 def writebundle(
1982 1982 ui, cg, filename, bundletype, vfs=None, compression=None, compopts=None
1983 1983 ):
1984 1984 """Write a bundle file and return its filename.
1985 1985
1986 1986 Existing files will not be overwritten.
1987 1987 If no filename is specified, a temporary file is created.
1988 1988 bz2 compression can be turned off.
1989 1989 The bundle file will be deleted in case of errors.
1990 1990 """
1991 1991
1992 1992 if bundletype == b"HG20":
1993 1993 bundle = bundle20(ui)
1994 1994 bundle.setcompression(compression, compopts)
1995 1995 part = bundle.newpart(b'changegroup', data=cg.getchunks())
1996 1996 part.addparam(b'version', cg.version)
1997 1997 if b'clcount' in cg.extras:
1998 1998 part.addparam(
1999 1999 b'nbchanges', b'%d' % cg.extras[b'clcount'], mandatory=False
2000 2000 )
2001 2001 chunkiter = bundle.getchunks()
2002 2002 else:
2003 2003 # compression argument is only for the bundle2 case
2004 2004 assert compression is None
2005 2005 if cg.version != b'01':
2006 2006 raise error.Abort(
2007 2007 _(b'old bundle types only supports v1 changegroups')
2008 2008 )
2009 2009
2010 2010 # HG20 is the case without 2 values to unpack, but is handled above.
2011 2011 # pytype: disable=bad-unpacking
2012 2012 header, comp = bundletypes[bundletype]
2013 2013 # pytype: enable=bad-unpacking
2014 2014
2015 2015 if comp not in util.compengines.supportedbundletypes:
2016 2016 raise error.Abort(_(b'unknown stream compression type: %s') % comp)
2017 2017 compengine = util.compengines.forbundletype(comp)
2018 2018
2019 2019 def chunkiter():
2020 2020 yield header
2021 2021 for chunk in compengine.compressstream(cg.getchunks(), compopts):
2022 2022 yield chunk
2023 2023
2024 2024 chunkiter = chunkiter()
2025 2025
2026 2026 # parse the changegroup data, otherwise we will block
2027 2027 # in case of sshrepo because we don't know the end of the stream
2028 2028 return changegroup.writechunks(ui, chunkiter, filename, vfs=vfs)
2029 2029
2030 2030
2031 2031 def combinechangegroupresults(op):
2032 2032 """logic to combine 0 or more addchangegroup results into one"""
2033 2033 results = [r.get(b'return', 0) for r in op.records[b'changegroup']]
2034 2034 changedheads = 0
2035 2035 result = 1
2036 2036 for ret in results:
2037 2037 # If any changegroup result is 0, return 0
2038 2038 if ret == 0:
2039 2039 result = 0
2040 2040 break
2041 2041 if ret < -1:
2042 2042 changedheads += ret + 1
2043 2043 elif ret > 1:
2044 2044 changedheads += ret - 1
2045 2045 if changedheads > 0:
2046 2046 result = 1 + changedheads
2047 2047 elif changedheads < 0:
2048 2048 result = -1 + changedheads
2049 2049 return result
2050 2050
2051 2051
2052 2052 @parthandler(
2053 2053 b'changegroup',
2054 2054 (
2055 2055 b'version',
2056 2056 b'nbchanges',
2057 2057 b'exp-sidedata',
2058 2058 b'exp-wanted-sidedata',
2059 2059 b'treemanifest',
2060 2060 b'targetphase',
2061 2061 ),
2062 2062 )
2063 2063 def handlechangegroup(op, inpart):
2064 2064 """apply a changegroup part on the repo"""
2065 2065 from . import localrepo
2066 2066
2067 2067 tr = op.gettransaction()
2068 2068 unpackerversion = inpart.params.get(b'version', b'01')
2069 2069 # We should raise an appropriate exception here
2070 2070 cg = changegroup.getunbundler(unpackerversion, inpart, None)
2071 2071 # the source and url passed here are overwritten by the one contained in
2072 2072 # the transaction.hookargs argument. So 'bundle2' is a placeholder
2073 2073 nbchangesets = None
2074 2074 if b'nbchanges' in inpart.params:
2075 2075 nbchangesets = int(inpart.params.get(b'nbchanges'))
2076 2076 if b'treemanifest' in inpart.params and not scmutil.istreemanifest(op.repo):
2077 2077 if len(op.repo.changelog) != 0:
2078 2078 raise error.Abort(
2079 2079 _(
2080 2080 b"bundle contains tree manifests, but local repo is "
2081 2081 b"non-empty and does not use tree manifests"
2082 2082 )
2083 2083 )
2084 2084 op.repo.requirements.add(requirements.TREEMANIFEST_REQUIREMENT)
2085 2085 op.repo.svfs.options = localrepo.resolvestorevfsoptions(
2086 2086 op.repo.ui, op.repo.requirements, op.repo.features
2087 2087 )
2088 2088 scmutil.writereporequirements(op.repo)
2089 2089
2090 2090 extrakwargs = {}
2091 2091 targetphase = inpart.params.get(b'targetphase')
2092 2092 if targetphase is not None:
2093 2093 extrakwargs['targetphase'] = int(targetphase)
2094 2094
2095 2095 remote_sidedata = inpart.params.get(b'exp-wanted-sidedata')
2096 2096 extrakwargs['sidedata_categories'] = read_wanted_sidedata(remote_sidedata)
2097 2097
2098 2098 ret = _processchangegroup(
2099 2099 op,
2100 2100 cg,
2101 2101 tr,
2102 2102 op.source,
2103 2103 b'bundle2',
2104 2104 expectedtotal=nbchangesets,
2105 2105 **extrakwargs
2106 2106 )
2107 2107 if op.reply is not None:
2108 2108 # This is definitely not the final form of this
2109 2109 # return. But one need to start somewhere.
2110 2110 part = op.reply.newpart(b'reply:changegroup', mandatory=False)
2111 2111 part.addparam(
2112 2112 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2113 2113 )
2114 2114 part.addparam(b'return', b'%i' % ret, mandatory=False)
2115 2115 assert not inpart.read()
2116 2116
2117 2117
2118 2118 _remotechangegroupparams = tuple(
2119 2119 [b'url', b'size', b'digests']
2120 2120 + [b'digest:%s' % k for k in util.DIGESTS.keys()]
2121 2121 )
2122 2122
2123 2123
2124 2124 @parthandler(b'remote-changegroup', _remotechangegroupparams)
2125 2125 def handleremotechangegroup(op, inpart):
2126 2126 """apply a bundle10 on the repo, given an url and validation information
2127 2127
2128 2128 All the information about the remote bundle to import are given as
2129 2129 parameters. The parameters include:
2130 2130 - url: the url to the bundle10.
2131 2131 - size: the bundle10 file size. It is used to validate what was
2132 2132 retrieved by the client matches the server knowledge about the bundle.
2133 2133 - digests: a space separated list of the digest types provided as
2134 2134 parameters.
2135 2135 - digest:<digest-type>: the hexadecimal representation of the digest with
2136 2136 that name. Like the size, it is used to validate what was retrieved by
2137 2137 the client matches what the server knows about the bundle.
2138 2138
2139 2139 When multiple digest types are given, all of them are checked.
2140 2140 """
2141 2141 try:
2142 2142 raw_url = inpart.params[b'url']
2143 2143 except KeyError:
2144 2144 raise error.Abort(_(b'remote-changegroup: missing "%s" param') % b'url')
2145 2145 parsed_url = urlutil.url(raw_url)
2146 2146 if parsed_url.scheme not in capabilities[b'remote-changegroup']:
2147 2147 raise error.Abort(
2148 2148 _(b'remote-changegroup does not support %s urls')
2149 2149 % parsed_url.scheme
2150 2150 )
2151 2151
2152 2152 try:
2153 2153 size = int(inpart.params[b'size'])
2154 2154 except ValueError:
2155 2155 raise error.Abort(
2156 2156 _(b'remote-changegroup: invalid value for param "%s"') % b'size'
2157 2157 )
2158 2158 except KeyError:
2159 2159 raise error.Abort(
2160 2160 _(b'remote-changegroup: missing "%s" param') % b'size'
2161 2161 )
2162 2162
2163 2163 digests = {}
2164 2164 for typ in inpart.params.get(b'digests', b'').split():
2165 2165 param = b'digest:%s' % typ
2166 2166 try:
2167 2167 value = inpart.params[param]
2168 2168 except KeyError:
2169 2169 raise error.Abort(
2170 2170 _(b'remote-changegroup: missing "%s" param') % param
2171 2171 )
2172 2172 digests[typ] = value
2173 2173
2174 2174 real_part = util.digestchecker(url.open(op.ui, raw_url), size, digests)
2175 2175
2176 2176 tr = op.gettransaction()
2177 2177 from . import exchange
2178 2178
2179 2179 cg = exchange.readbundle(op.repo.ui, real_part, raw_url)
2180 2180 if not isinstance(cg, changegroup.cg1unpacker):
2181 2181 raise error.Abort(
2182 2182 _(b'%s: not a bundle version 1.0') % urlutil.hidepassword(raw_url)
2183 2183 )
2184 2184 ret = _processchangegroup(op, cg, tr, op.source, b'bundle2')
2185 2185 if op.reply is not None:
2186 2186 # This is definitely not the final form of this
2187 2187 # return. But one need to start somewhere.
2188 2188 part = op.reply.newpart(b'reply:changegroup')
2189 2189 part.addparam(
2190 2190 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2191 2191 )
2192 2192 part.addparam(b'return', b'%i' % ret, mandatory=False)
2193 2193 try:
2194 2194 real_part.validate()
2195 2195 except error.Abort as e:
2196 2196 raise error.Abort(
2197 2197 _(b'bundle at %s is corrupted:\n%s')
2198 2198 % (urlutil.hidepassword(raw_url), e.message)
2199 2199 )
2200 2200 assert not inpart.read()
2201 2201
2202 2202
2203 2203 @parthandler(b'reply:changegroup', (b'return', b'in-reply-to'))
2204 2204 def handlereplychangegroup(op, inpart):
2205 2205 ret = int(inpart.params[b'return'])
2206 2206 replyto = int(inpart.params[b'in-reply-to'])
2207 2207 op.records.add(b'changegroup', {b'return': ret}, replyto)
2208 2208
2209 2209
2210 2210 @parthandler(b'check:bookmarks')
2211 2211 def handlecheckbookmarks(op, inpart):
2212 2212 """check location of bookmarks
2213 2213
2214 2214 This part is to be used to detect push race regarding bookmark, it
2215 2215 contains binary encoded (bookmark, node) tuple. If the local state does
2216 2216 not marks the one in the part, a PushRaced exception is raised
2217 2217 """
2218 2218 bookdata = bookmarks.binarydecode(op.repo, inpart)
2219 2219
2220 2220 msgstandard = (
2221 2221 b'remote repository changed while pushing - please try again '
2222 2222 b'(bookmark "%s" move from %s to %s)'
2223 2223 )
2224 2224 msgmissing = (
2225 2225 b'remote repository changed while pushing - please try again '
2226 2226 b'(bookmark "%s" is missing, expected %s)'
2227 2227 )
2228 2228 msgexist = (
2229 2229 b'remote repository changed while pushing - please try again '
2230 2230 b'(bookmark "%s" set on %s, expected missing)'
2231 2231 )
2232 2232 for book, node in bookdata:
2233 2233 currentnode = op.repo._bookmarks.get(book)
2234 2234 if currentnode != node:
2235 2235 if node is None:
2236 2236 finalmsg = msgexist % (book, short(currentnode))
2237 2237 elif currentnode is None:
2238 2238 finalmsg = msgmissing % (book, short(node))
2239 2239 else:
2240 2240 finalmsg = msgstandard % (
2241 2241 book,
2242 2242 short(node),
2243 2243 short(currentnode),
2244 2244 )
2245 2245 raise error.PushRaced(finalmsg)
2246 2246
2247 2247
2248 2248 @parthandler(b'check:heads')
2249 2249 def handlecheckheads(op, inpart):
2250 2250 """check that head of the repo did not change
2251 2251
2252 2252 This is used to detect a push race when using unbundle.
2253 2253 This replaces the "heads" argument of unbundle."""
2254 2254 h = inpart.read(20)
2255 2255 heads = []
2256 2256 while len(h) == 20:
2257 2257 heads.append(h)
2258 2258 h = inpart.read(20)
2259 2259 assert not h
2260 2260 # Trigger a transaction so that we are guaranteed to have the lock now.
2261 2261 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2262 2262 op.gettransaction()
2263 2263 if sorted(heads) != sorted(op.repo.heads()):
2264 2264 raise error.PushRaced(
2265 2265 b'remote repository changed while pushing - please try again'
2266 2266 )
2267 2267
2268 2268
2269 2269 @parthandler(b'check:updated-heads')
2270 2270 def handlecheckupdatedheads(op, inpart):
2271 2271 """check for race on the heads touched by a push
2272 2272
2273 2273 This is similar to 'check:heads' but focus on the heads actually updated
2274 2274 during the push. If other activities happen on unrelated heads, it is
2275 2275 ignored.
2276 2276
2277 2277 This allow server with high traffic to avoid push contention as long as
2278 2278 unrelated parts of the graph are involved."""
2279 2279 h = inpart.read(20)
2280 2280 heads = []
2281 2281 while len(h) == 20:
2282 2282 heads.append(h)
2283 2283 h = inpart.read(20)
2284 2284 assert not h
2285 2285 # trigger a transaction so that we are guaranteed to have the lock now.
2286 2286 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2287 2287 op.gettransaction()
2288 2288
2289 2289 currentheads = set()
2290 2290 for ls in op.repo.branchmap().iterheads():
2291 2291 currentheads.update(ls)
2292 2292
2293 2293 for h in heads:
2294 2294 if h not in currentheads:
2295 2295 raise error.PushRaced(
2296 2296 b'remote repository changed while pushing - '
2297 2297 b'please try again'
2298 2298 )
2299 2299
2300 2300
2301 2301 @parthandler(b'check:phases')
2302 2302 def handlecheckphases(op, inpart):
2303 2303 """check that phase boundaries of the repository did not change
2304 2304
2305 2305 This is used to detect a push race.
2306 2306 """
2307 2307 phasetonodes = phases.binarydecode(inpart)
2308 2308 unfi = op.repo.unfiltered()
2309 2309 cl = unfi.changelog
2310 2310 phasecache = unfi._phasecache
2311 2311 msg = (
2312 2312 b'remote repository changed while pushing - please try again '
2313 2313 b'(%s is %s expected %s)'
2314 2314 )
2315 2315 for expectedphase, nodes in phasetonodes.items():
2316 2316 for n in nodes:
2317 2317 actualphase = phasecache.phase(unfi, cl.rev(n))
2318 2318 if actualphase != expectedphase:
2319 2319 finalmsg = msg % (
2320 2320 short(n),
2321 2321 phases.phasenames[actualphase],
2322 2322 phases.phasenames[expectedphase],
2323 2323 )
2324 2324 raise error.PushRaced(finalmsg)
2325 2325
2326 2326
2327 2327 @parthandler(b'output')
2328 2328 def handleoutput(op, inpart):
2329 2329 """forward output captured on the server to the client"""
2330 2330 for line in inpart.read().splitlines():
2331 2331 op.ui.status(_(b'remote: %s\n') % line)
2332 2332
2333 2333
2334 2334 @parthandler(b'replycaps')
2335 2335 def handlereplycaps(op, inpart):
2336 2336 """Notify that a reply bundle should be created
2337 2337
2338 2338 The payload contains the capabilities information for the reply"""
2339 2339 caps = decodecaps(inpart.read())
2340 2340 if op.reply is None:
2341 2341 op.reply = bundle20(op.ui, caps)
2342 2342
2343 2343
2344 2344 class AbortFromPart(error.Abort):
2345 2345 """Sub-class of Abort that denotes an error from a bundle2 part."""
2346 2346
2347 2347
2348 2348 @parthandler(b'error:abort', (b'message', b'hint'))
2349 2349 def handleerrorabort(op, inpart):
2350 2350 """Used to transmit abort error over the wire"""
2351 2351 raise AbortFromPart(
2352 2352 inpart.params[b'message'], hint=inpart.params.get(b'hint')
2353 2353 )
2354 2354
2355 2355
2356 2356 @parthandler(
2357 2357 b'error:pushkey',
2358 2358 (b'namespace', b'key', b'new', b'old', b'ret', b'in-reply-to'),
2359 2359 )
2360 2360 def handleerrorpushkey(op, inpart):
2361 2361 """Used to transmit failure of a mandatory pushkey over the wire"""
2362 2362 kwargs = {}
2363 2363 for name in (b'namespace', b'key', b'new', b'old', b'ret'):
2364 2364 value = inpart.params.get(name)
2365 2365 if value is not None:
2366 2366 kwargs[name] = value
2367 2367 raise error.PushkeyFailed(
2368 2368 inpart.params[b'in-reply-to'], **pycompat.strkwargs(kwargs)
2369 2369 )
2370 2370
2371 2371
2372 2372 @parthandler(b'error:unsupportedcontent', (b'parttype', b'params'))
2373 2373 def handleerrorunsupportedcontent(op, inpart):
2374 2374 """Used to transmit unknown content error over the wire"""
2375 2375 kwargs = {}
2376 2376 parttype = inpart.params.get(b'parttype')
2377 2377 if parttype is not None:
2378 2378 kwargs[b'parttype'] = parttype
2379 2379 params = inpart.params.get(b'params')
2380 2380 if params is not None:
2381 2381 kwargs[b'params'] = params.split(b'\0')
2382 2382
2383 2383 raise error.BundleUnknownFeatureError(**pycompat.strkwargs(kwargs))
2384 2384
2385 2385
2386 2386 @parthandler(b'error:pushraced', (b'message',))
2387 2387 def handleerrorpushraced(op, inpart):
2388 2388 """Used to transmit push race error over the wire"""
2389 2389 raise error.ResponseError(_(b'push failed:'), inpart.params[b'message'])
2390 2390
2391 2391
2392 2392 @parthandler(b'listkeys', (b'namespace',))
2393 2393 def handlelistkeys(op, inpart):
2394 2394 """retrieve pushkey namespace content stored in a bundle2"""
2395 2395 namespace = inpart.params[b'namespace']
2396 2396 r = pushkey.decodekeys(inpart.read())
2397 2397 op.records.add(b'listkeys', (namespace, r))
2398 2398
2399 2399
2400 2400 @parthandler(b'pushkey', (b'namespace', b'key', b'old', b'new'))
2401 2401 def handlepushkey(op, inpart):
2402 2402 """process a pushkey request"""
2403 2403 dec = pushkey.decode
2404 2404 namespace = dec(inpart.params[b'namespace'])
2405 2405 key = dec(inpart.params[b'key'])
2406 2406 old = dec(inpart.params[b'old'])
2407 2407 new = dec(inpart.params[b'new'])
2408 2408 # Grab the transaction to ensure that we have the lock before performing the
2409 2409 # pushkey.
2410 2410 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2411 2411 op.gettransaction()
2412 2412 ret = op.repo.pushkey(namespace, key, old, new)
2413 2413 record = {b'namespace': namespace, b'key': key, b'old': old, b'new': new}
2414 2414 op.records.add(b'pushkey', record)
2415 2415 if op.reply is not None:
2416 2416 rpart = op.reply.newpart(b'reply:pushkey')
2417 2417 rpart.addparam(
2418 2418 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2419 2419 )
2420 2420 rpart.addparam(b'return', b'%i' % ret, mandatory=False)
2421 2421 if inpart.mandatory and not ret:
2422 2422 kwargs = {}
2423 2423 for key in (b'namespace', b'key', b'new', b'old', b'ret'):
2424 2424 if key in inpart.params:
2425 2425 kwargs[key] = inpart.params[key]
2426 2426 raise error.PushkeyFailed(
2427 2427 partid=b'%d' % inpart.id, **pycompat.strkwargs(kwargs)
2428 2428 )
2429 2429
2430 2430
2431 2431 @parthandler(b'bookmarks')
2432 2432 def handlebookmark(op, inpart):
2433 2433 """transmit bookmark information
2434 2434
2435 2435 The part contains binary encoded bookmark information.
2436 2436
2437 2437 The exact behavior of this part can be controlled by the 'bookmarks' mode
2438 2438 on the bundle operation.
2439 2439
2440 2440 When mode is 'apply' (the default) the bookmark information is applied as
2441 2441 is to the unbundling repository. Make sure a 'check:bookmarks' part is
2442 2442 issued earlier to check for push races in such update. This behavior is
2443 2443 suitable for pushing.
2444 2444
2445 2445 When mode is 'records', the information is recorded into the 'bookmarks'
2446 2446 records of the bundle operation. This behavior is suitable for pulling.
2447 2447 """
2448 2448 changes = bookmarks.binarydecode(op.repo, inpart)
2449 2449
2450 2450 pushkeycompat = op.repo.ui.configbool(
2451 2451 b'server', b'bookmarks-pushkey-compat'
2452 2452 )
2453 2453 bookmarksmode = op.modes.get(b'bookmarks', b'apply')
2454 2454
2455 2455 if bookmarksmode == b'apply':
2456 2456 tr = op.gettransaction()
2457 2457 bookstore = op.repo._bookmarks
2458 2458 if pushkeycompat:
2459 2459 allhooks = []
2460 2460 for book, node in changes:
2461 2461 hookargs = tr.hookargs.copy()
2462 2462 hookargs[b'pushkeycompat'] = b'1'
2463 2463 hookargs[b'namespace'] = b'bookmarks'
2464 2464 hookargs[b'key'] = book
2465 2465 hookargs[b'old'] = hex(bookstore.get(book, b''))
2466 2466 hookargs[b'new'] = hex(node if node is not None else b'')
2467 2467 allhooks.append(hookargs)
2468 2468
2469 2469 for hookargs in allhooks:
2470 2470 op.repo.hook(
2471 2471 b'prepushkey', throw=True, **pycompat.strkwargs(hookargs)
2472 2472 )
2473 2473
2474 2474 for book, node in changes:
2475 2475 if bookmarks.isdivergent(book):
2476 2476 msg = _(b'cannot accept divergent bookmark %s!') % book
2477 2477 raise error.Abort(msg)
2478 2478
2479 2479 bookstore.applychanges(op.repo, op.gettransaction(), changes)
2480 2480
2481 2481 if pushkeycompat:
2482 2482
2483 2483 def runhook(unused_success):
2484 2484 for hookargs in allhooks:
2485 2485 op.repo.hook(b'pushkey', **pycompat.strkwargs(hookargs))
2486 2486
2487 2487 op.repo._afterlock(runhook)
2488 2488
2489 2489 elif bookmarksmode == b'records':
2490 2490 for book, node in changes:
2491 2491 record = {b'bookmark': book, b'node': node}
2492 2492 op.records.add(b'bookmarks', record)
2493 2493 else:
2494 2494 raise error.ProgrammingError(
2495 2495 b'unknown bookmark mode: %s' % bookmarksmode
2496 2496 )
2497 2497
2498 2498
2499 2499 @parthandler(b'phase-heads')
2500 2500 def handlephases(op, inpart):
2501 2501 """apply phases from bundle part to repo"""
2502 2502 headsbyphase = phases.binarydecode(inpart)
2503 2503 phases.updatephases(op.repo.unfiltered(), op.gettransaction, headsbyphase)
2504 2504
2505 2505
2506 2506 @parthandler(b'reply:pushkey', (b'return', b'in-reply-to'))
2507 2507 def handlepushkeyreply(op, inpart):
2508 2508 """retrieve the result of a pushkey request"""
2509 2509 ret = int(inpart.params[b'return'])
2510 2510 partid = int(inpart.params[b'in-reply-to'])
2511 2511 op.records.add(b'pushkey', {b'return': ret}, partid)
2512 2512
2513 2513
2514 2514 @parthandler(b'obsmarkers')
2515 2515 def handleobsmarker(op, inpart):
2516 2516 """add a stream of obsmarkers to the repo"""
2517 2517 tr = op.gettransaction()
2518 2518 markerdata = inpart.read()
2519 2519 if op.ui.config(b'experimental', b'obsmarkers-exchange-debug'):
2520 2520 op.ui.writenoi18n(
2521 2521 b'obsmarker-exchange: %i bytes received\n' % len(markerdata)
2522 2522 )
2523 2523 # The mergemarkers call will crash if marker creation is not enabled.
2524 2524 # we want to avoid this if the part is advisory.
2525 2525 if not inpart.mandatory and op.repo.obsstore.readonly:
2526 2526 op.repo.ui.debug(
2527 2527 b'ignoring obsolescence markers, feature not enabled\n'
2528 2528 )
2529 2529 return
2530 2530 new = op.repo.obsstore.mergemarkers(tr, markerdata)
2531 2531 op.repo.invalidatevolatilesets()
2532 2532 op.records.add(b'obsmarkers', {b'new': new})
2533 2533 if op.reply is not None:
2534 2534 rpart = op.reply.newpart(b'reply:obsmarkers')
2535 2535 rpart.addparam(
2536 2536 b'in-reply-to', pycompat.bytestr(inpart.id), mandatory=False
2537 2537 )
2538 2538 rpart.addparam(b'new', b'%i' % new, mandatory=False)
2539 2539
2540 2540
2541 2541 @parthandler(b'reply:obsmarkers', (b'new', b'in-reply-to'))
2542 2542 def handleobsmarkerreply(op, inpart):
2543 2543 """retrieve the result of a pushkey request"""
2544 2544 ret = int(inpart.params[b'new'])
2545 2545 partid = int(inpart.params[b'in-reply-to'])
2546 2546 op.records.add(b'obsmarkers', {b'new': ret}, partid)
2547 2547
2548 2548
2549 2549 @parthandler(b'hgtagsfnodes')
2550 2550 def handlehgtagsfnodes(op, inpart):
2551 2551 """Applies .hgtags fnodes cache entries to the local repo.
2552 2552
2553 2553 Payload is pairs of 20 byte changeset nodes and filenodes.
2554 2554 """
2555 2555 # Grab the transaction so we ensure that we have the lock at this point.
2556 2556 if op.ui.configbool(b'experimental', b'bundle2lazylocking'):
2557 2557 op.gettransaction()
2558 2558 cache = tags.hgtagsfnodescache(op.repo.unfiltered())
2559 2559
2560 2560 count = 0
2561 2561 while True:
2562 2562 node = inpart.read(20)
2563 2563 fnode = inpart.read(20)
2564 2564 if len(node) < 20 or len(fnode) < 20:
2565 2565 op.ui.debug(b'ignoring incomplete received .hgtags fnodes data\n')
2566 2566 break
2567 2567 cache.setfnode(node, fnode)
2568 2568 count += 1
2569 2569
2570 2570 cache.write()
2571 2571 op.ui.debug(b'applied %i hgtags fnodes cache entries\n' % count)
2572 2572
2573 2573
2574 2574 rbcstruct = struct.Struct(b'>III')
2575 2575
2576 2576
2577 2577 @parthandler(b'cache:rev-branch-cache')
2578 2578 def handlerbc(op, inpart):
2579 2579 """Legacy part, ignored for compatibility with bundles from or
2580 2580 for Mercurial before 5.7. Newer Mercurial computes the cache
2581 2581 efficiently enough during unbundling that the additional transfer
2582 2582 is unnecessary."""
2583 2583
2584 2584
2585 2585 @parthandler(b'pushvars')
2586 2586 def bundle2getvars(op, part):
2587 2587 '''unbundle a bundle2 containing shellvars on the server'''
2588 2588 # An option to disable unbundling on server-side for security reasons
2589 2589 if op.ui.configbool(b'push', b'pushvars.server'):
2590 2590 hookargs = {}
2591 2591 for key, value in part.advisoryparams:
2592 2592 key = key.upper()
2593 2593 # We want pushed variables to have USERVAR_ prepended so we know
2594 2594 # they came from the --pushvar flag.
2595 2595 key = b"USERVAR_" + key
2596 2596 hookargs[key] = value
2597 2597 op.addhookargs(hookargs)
2598 2598
2599 2599
2600 2600 @parthandler(b'stream2', (b'requirements', b'filecount', b'bytecount'))
2601 2601 def handlestreamv2bundle(op, part):
2602 2602
2603 2603 requirements = urlreq.unquote(part.params[b'requirements'])
2604 2604 requirements = requirements.split(b',') if requirements else []
2605 2605 filecount = int(part.params[b'filecount'])
2606 2606 bytecount = int(part.params[b'bytecount'])
2607 2607
2608 2608 repo = op.repo
2609 2609 if len(repo):
2610 2610 msg = _(b'cannot apply stream clone to non empty repository')
2611 2611 raise error.Abort(msg)
2612 2612
2613 2613 repo.ui.debug(b'applying stream bundle\n')
2614 2614 streamclone.applybundlev2(repo, part, filecount, bytecount, requirements)
2615 2615
2616 2616
2617 2617 @parthandler(b'stream3-exp', (b'requirements',))
2618 2618 def handlestreamv3bundle(op, part):
2619 2619 requirements = urlreq.unquote(part.params[b'requirements'])
2620 2620 requirements = requirements.split(b',') if requirements else []
2621 2621
2622 2622 repo = op.repo
2623 2623 if len(repo):
2624 2624 msg = _(b'cannot apply stream clone to non empty repository')
2625 2625 raise error.Abort(msg)
2626 2626
2627 2627 repo.ui.debug(b'applying stream bundle\n')
2628 2628 streamclone.applybundlev3(repo, part, requirements)
2629 2629
2630 2630
2631 2631 def widen_bundle(
2632 2632 bundler, repo, oldmatcher, newmatcher, common, known, cgversion, ellipses
2633 2633 ):
2634 2634 """generates bundle2 for widening a narrow clone
2635 2635
2636 2636 bundler is the bundle to which data should be added
2637 2637 repo is the localrepository instance
2638 2638 oldmatcher matches what the client already has
2639 2639 newmatcher matches what the client needs (including what it already has)
2640 2640 common is set of common heads between server and client
2641 2641 known is a set of revs known on the client side (used in ellipses)
2642 2642 cgversion is the changegroup version to send
2643 2643 ellipses is boolean value telling whether to send ellipses data or not
2644 2644
2645 2645 returns bundle2 of the data required for extending
2646 2646 """
2647 2647 commonnodes = set()
2648 2648 cl = repo.changelog
2649 2649 for r in repo.revs(b"::%ln", common):
2650 2650 commonnodes.add(cl.node(r))
2651 2651 if commonnodes:
2652 2652 packer = changegroup.getbundler(
2653 2653 cgversion,
2654 2654 repo,
2655 2655 oldmatcher=oldmatcher,
2656 2656 matcher=newmatcher,
2657 2657 fullnodes=commonnodes,
2658 2658 )
2659 2659 cgdata = packer.generate(
2660 2660 {repo.nullid},
2661 2661 list(commonnodes),
2662 2662 False,
2663 2663 b'narrow_widen',
2664 2664 changelog=False,
2665 2665 )
2666 2666
2667 2667 part = bundler.newpart(b'changegroup', data=cgdata)
2668 2668 part.addparam(b'version', cgversion)
2669 2669 if scmutil.istreemanifest(repo):
2670 2670 part.addparam(b'treemanifest', b'1')
2671 2671 if repository.REPO_FEATURE_SIDE_DATA in repo.features:
2672 2672 part.addparam(b'exp-sidedata', b'1')
2673 2673 wanted = format_remote_wanted_sidedata(repo)
2674 2674 part.addparam(b'exp-wanted-sidedata', wanted)
2675 2675
2676 2676 return bundler
General Comments 0
You need to be logged in to leave comments. Login now