##// END OF EJS Templates
eol: separate .hgeol parsing from merge in ui...
Patrick Mezard -
r13613:85b80261 default
parent child Browse files
Show More
@@ -1,299 +1,303
1 """automatically manage newlines in repository files
1 """automatically manage newlines in repository files
2
2
3 This extension allows you to manage the type of line endings (CRLF or
3 This extension allows you to manage the type of line endings (CRLF or
4 LF) that are used in the repository and in the local working
4 LF) that are used in the repository and in the local working
5 directory. That way you can get CRLF line endings on Windows and LF on
5 directory. That way you can get CRLF line endings on Windows and LF on
6 Unix/Mac, thereby letting everybody use their OS native line endings.
6 Unix/Mac, thereby letting everybody use their OS native line endings.
7
7
8 The extension reads its configuration from a versioned ``.hgeol``
8 The extension reads its configuration from a versioned ``.hgeol``
9 configuration file found in the root of the working copy. The
9 configuration file found in the root of the working copy. The
10 ``.hgeol`` file use the same syntax as all other Mercurial
10 ``.hgeol`` file use the same syntax as all other Mercurial
11 configuration files. It uses two sections, ``[patterns]`` and
11 configuration files. It uses two sections, ``[patterns]`` and
12 ``[repository]``.
12 ``[repository]``.
13
13
14 The ``[patterns]`` section specifies how line endings should be
14 The ``[patterns]`` section specifies how line endings should be
15 converted between the working copy and the repository. The format is
15 converted between the working copy and the repository. The format is
16 specified by a file pattern. The first match is used, so put more
16 specified by a file pattern. The first match is used, so put more
17 specific patterns first. The available line endings are ``LF``,
17 specific patterns first. The available line endings are ``LF``,
18 ``CRLF``, and ``BIN``.
18 ``CRLF``, and ``BIN``.
19
19
20 Files with the declared format of ``CRLF`` or ``LF`` are always
20 Files with the declared format of ``CRLF`` or ``LF`` are always
21 checked out and stored in the repository in that format and files
21 checked out and stored in the repository in that format and files
22 declared to be binary (``BIN``) are left unchanged. Additionally,
22 declared to be binary (``BIN``) are left unchanged. Additionally,
23 ``native`` is an alias for checking out in the platform's default line
23 ``native`` is an alias for checking out in the platform's default line
24 ending: ``LF`` on Unix (including Mac OS X) and ``CRLF`` on
24 ending: ``LF`` on Unix (including Mac OS X) and ``CRLF`` on
25 Windows. Note that ``BIN`` (do nothing to line endings) is Mercurial's
25 Windows. Note that ``BIN`` (do nothing to line endings) is Mercurial's
26 default behaviour; it is only needed if you need to override a later,
26 default behaviour; it is only needed if you need to override a later,
27 more general pattern.
27 more general pattern.
28
28
29 The optional ``[repository]`` section specifies the line endings to
29 The optional ``[repository]`` section specifies the line endings to
30 use for files stored in the repository. It has a single setting,
30 use for files stored in the repository. It has a single setting,
31 ``native``, which determines the storage line endings for files
31 ``native``, which determines the storage line endings for files
32 declared as ``native`` in the ``[patterns]`` section. It can be set to
32 declared as ``native`` in the ``[patterns]`` section. It can be set to
33 ``LF`` or ``CRLF``. The default is ``LF``. For example, this means
33 ``LF`` or ``CRLF``. The default is ``LF``. For example, this means
34 that on Windows, files configured as ``native`` (``CRLF`` by default)
34 that on Windows, files configured as ``native`` (``CRLF`` by default)
35 will be converted to ``LF`` when stored in the repository. Files
35 will be converted to ``LF`` when stored in the repository. Files
36 declared as ``LF``, ``CRLF``, or ``BIN`` in the ``[patterns]`` section
36 declared as ``LF``, ``CRLF``, or ``BIN`` in the ``[patterns]`` section
37 are always stored as-is in the repository.
37 are always stored as-is in the repository.
38
38
39 Example versioned ``.hgeol`` file::
39 Example versioned ``.hgeol`` file::
40
40
41 [patterns]
41 [patterns]
42 **.py = native
42 **.py = native
43 **.vcproj = CRLF
43 **.vcproj = CRLF
44 **.txt = native
44 **.txt = native
45 Makefile = LF
45 Makefile = LF
46 **.jpg = BIN
46 **.jpg = BIN
47
47
48 [repository]
48 [repository]
49 native = LF
49 native = LF
50
50
51 .. note::
51 .. note::
52 The rules will first apply when files are touched in the working
52 The rules will first apply when files are touched in the working
53 copy, e.g. by updating to null and back to tip to touch all files.
53 copy, e.g. by updating to null and back to tip to touch all files.
54
54
55 The extension uses an optional ``[eol]`` section in your hgrc file
55 The extension uses an optional ``[eol]`` section in your hgrc file
56 (not the ``.hgeol`` file) for settings that control the overall
56 (not the ``.hgeol`` file) for settings that control the overall
57 behavior. There are two settings:
57 behavior. There are two settings:
58
58
59 - ``eol.native`` (default ``os.linesep``) can be set to ``LF`` or
59 - ``eol.native`` (default ``os.linesep``) can be set to ``LF`` or
60 ``CRLF`` to override the default interpretation of ``native`` for
60 ``CRLF`` to override the default interpretation of ``native`` for
61 checkout. This can be used with :hg:`archive` on Unix, say, to
61 checkout. This can be used with :hg:`archive` on Unix, say, to
62 generate an archive where files have line endings for Windows.
62 generate an archive where files have line endings for Windows.
63
63
64 - ``eol.only-consistent`` (default True) can be set to False to make
64 - ``eol.only-consistent`` (default True) can be set to False to make
65 the extension convert files with inconsistent EOLs. Inconsistent
65 the extension convert files with inconsistent EOLs. Inconsistent
66 means that there is both ``CRLF`` and ``LF`` present in the file.
66 means that there is both ``CRLF`` and ``LF`` present in the file.
67 Such files are normally not touched under the assumption that they
67 Such files are normally not touched under the assumption that they
68 have mixed EOLs on purpose.
68 have mixed EOLs on purpose.
69
69
70 The extension provides ``cleverencode:`` and ``cleverdecode:`` filters
70 The extension provides ``cleverencode:`` and ``cleverdecode:`` filters
71 like the deprecated win32text extension does. This means that you can
71 like the deprecated win32text extension does. This means that you can
72 disable win32text and enable eol and your filters will still work. You
72 disable win32text and enable eol and your filters will still work. You
73 only need to these filters until you have prepared a ``.hgeol`` file.
73 only need to these filters until you have prepared a ``.hgeol`` file.
74
74
75 The ``win32text.forbid*`` hooks provided by the win32text extension
75 The ``win32text.forbid*`` hooks provided by the win32text extension
76 have been unified into a single hook named ``eol.hook``. The hook will
76 have been unified into a single hook named ``eol.hook``. The hook will
77 lookup the expected line endings from the ``.hgeol`` file, which means
77 lookup the expected line endings from the ``.hgeol`` file, which means
78 you must migrate to a ``.hgeol`` file first before using the hook.
78 you must migrate to a ``.hgeol`` file first before using the hook.
79 Remember to enable the eol extension in the repository where you
79 Remember to enable the eol extension in the repository where you
80 install the hook.
80 install the hook.
81
81
82 See :hg:`help patterns` for more information about the glob patterns
82 See :hg:`help patterns` for more information about the glob patterns
83 used.
83 used.
84 """
84 """
85
85
86 from mercurial.i18n import _
86 from mercurial.i18n import _
87 from mercurial import util, config, extensions, match, error
87 from mercurial import util, config, extensions, match, error
88 import re, os
88 import re, os
89
89
90 # Matches a lone LF, i.e., one that is not part of CRLF.
90 # Matches a lone LF, i.e., one that is not part of CRLF.
91 singlelf = re.compile('(^|[^\r])\n')
91 singlelf = re.compile('(^|[^\r])\n')
92 # Matches a single EOL which can either be a CRLF where repeated CR
92 # Matches a single EOL which can either be a CRLF where repeated CR
93 # are removed or a LF. We do not care about old Machintosh files, so a
93 # are removed or a LF. We do not care about old Machintosh files, so a
94 # stray CR is an error.
94 # stray CR is an error.
95 eolre = re.compile('\r*\n')
95 eolre = re.compile('\r*\n')
96
96
97
97
98 def inconsistenteol(data):
98 def inconsistenteol(data):
99 return '\r\n' in data and singlelf.search(data)
99 return '\r\n' in data and singlelf.search(data)
100
100
101 def tolf(s, params, ui, **kwargs):
101 def tolf(s, params, ui, **kwargs):
102 """Filter to convert to LF EOLs."""
102 """Filter to convert to LF EOLs."""
103 if util.binary(s):
103 if util.binary(s):
104 return s
104 return s
105 if ui.configbool('eol', 'only-consistent', True) and inconsistenteol(s):
105 if ui.configbool('eol', 'only-consistent', True) and inconsistenteol(s):
106 return s
106 return s
107 return eolre.sub('\n', s)
107 return eolre.sub('\n', s)
108
108
109 def tocrlf(s, params, ui, **kwargs):
109 def tocrlf(s, params, ui, **kwargs):
110 """Filter to convert to CRLF EOLs."""
110 """Filter to convert to CRLF EOLs."""
111 if util.binary(s):
111 if util.binary(s):
112 return s
112 return s
113 if ui.configbool('eol', 'only-consistent', True) and inconsistenteol(s):
113 if ui.configbool('eol', 'only-consistent', True) and inconsistenteol(s):
114 return s
114 return s
115 return eolre.sub('\r\n', s)
115 return eolre.sub('\r\n', s)
116
116
117 def isbinary(s, params):
117 def isbinary(s, params):
118 """Filter to do nothing with the file."""
118 """Filter to do nothing with the file."""
119 return s
119 return s
120
120
121 filters = {
121 filters = {
122 'to-lf': tolf,
122 'to-lf': tolf,
123 'to-crlf': tocrlf,
123 'to-crlf': tocrlf,
124 'is-binary': isbinary,
124 'is-binary': isbinary,
125 # The following provide backwards compatibility with win32text
125 # The following provide backwards compatibility with win32text
126 'cleverencode:': tolf,
126 'cleverencode:': tolf,
127 'cleverdecode:': tocrlf
127 'cleverdecode:': tocrlf
128 }
128 }
129
129
130 class eolfile(object):
131 def __init__(self, ui, root, data):
132 self._decode = {'LF': 'to-lf', 'CRLF': 'to-crlf', 'BIN': 'is-binary'}
133 self._encode = {'LF': 'to-lf', 'CRLF': 'to-crlf', 'BIN': 'is-binary'}
134
135 self.cfg = config.config()
136 # Our files should not be touched. The pattern must be
137 # inserted first override a '** = native' pattern.
138 self.cfg.set('patterns', '.hg*', 'BIN')
139 # We can then parse the user's patterns.
140 self.cfg.parse('.hgeol', data)
141
142 isrepolf = self.cfg.get('repository', 'native') != 'CRLF'
143 self._encode['NATIVE'] = isrepolf and 'to-lf' or 'to-crlf'
144 iswdlf = ui.config('eol', 'native', os.linesep) in ('LF', '\n')
145 self._decode['NATIVE'] = iswdlf and 'to-lf' or 'to-crlf'
146
147 include = []
148 exclude = []
149 for pattern, style in self.cfg.items('patterns'):
150 key = style.upper()
151 if key == 'BIN':
152 exclude.append(pattern)
153 else:
154 include.append(pattern)
155 # This will match the files for which we need to care
156 # about inconsistent newlines.
157 self.match = match.match(root, '', [], include, exclude)
158
159 def setfilters(self, ui):
160 for pattern, style in self.cfg.items('patterns'):
161 key = style.upper()
162 try:
163 ui.setconfig('decode', pattern, self._decode[key])
164 ui.setconfig('encode', pattern, self._encode[key])
165 except KeyError:
166 ui.warn(_("ignoring unknown EOL style '%s' from %s\n")
167 % (style, self.cfg.source('patterns', pattern)))
168
169 def parseeol(ui, repo, node=None):
170 try:
171 if node is None:
172 # Cannot use workingctx.data() since it would load
173 # and cache the filters before we configure them.
174 data = repo.wfile('.hgeol').read()
175 else:
176 data = repo[node]['.hgeol'].data()
177 return eolfile(ui, repo.root, data)
178 except (IOError, LookupError):
179 return None
130
180
131 def hook(ui, repo, node, hooktype, **kwargs):
181 def hook(ui, repo, node, hooktype, **kwargs):
132 """verify that files have expected EOLs"""
182 """verify that files have expected EOLs"""
133 files = set()
183 files = set()
134 for rev in xrange(repo[node].rev(), len(repo)):
184 for rev in xrange(repo[node].rev(), len(repo)):
135 files.update(repo[rev].files())
185 files.update(repo[rev].files())
136 tip = repo['tip']
186 tip = repo['tip']
137 for f in files:
187 for f in files:
138 if f not in tip:
188 if f not in tip:
139 continue
189 continue
140 for pattern, target in ui.configitems('encode'):
190 for pattern, target in ui.configitems('encode'):
141 if match.match(repo.root, '', [pattern])(f):
191 if match.match(repo.root, '', [pattern])(f):
142 data = tip[f].data()
192 data = tip[f].data()
143 if target == "to-lf" and "\r\n" in data:
193 if target == "to-lf" and "\r\n" in data:
144 raise util.Abort(_("%s should not have CRLF line endings")
194 raise util.Abort(_("%s should not have CRLF line endings")
145 % f)
195 % f)
146 elif target == "to-crlf" and singlelf.search(data):
196 elif target == "to-crlf" and singlelf.search(data):
147 raise util.Abort(_("%s should not have LF line endings")
197 raise util.Abort(_("%s should not have LF line endings")
148 % f)
198 % f)
149 # Ignore other rules for this file
199 # Ignore other rules for this file
150 break
200 break
151
201
152
202
153 def preupdate(ui, repo, hooktype, parent1, parent2):
203 def preupdate(ui, repo, hooktype, parent1, parent2):
154 #print "preupdate for %s: %s -> %s" % (repo.root, parent1, parent2)
204 #print "preupdate for %s: %s -> %s" % (repo.root, parent1, parent2)
155 try:
205 try:
156 repo.readhgeol(parent1)
206 repo.loadeol(parent1)
157 except error.ParseError, inst:
207 except error.ParseError, inst:
158 ui.warn(_("warning: ignoring .hgeol file due to parse error "
208 ui.warn(_("warning: ignoring .hgeol file due to parse error "
159 "at %s: %s\n") % (inst.args[1], inst.args[0]))
209 "at %s: %s\n") % (inst.args[1], inst.args[0]))
160 return False
210 return False
161
211
162 def uisetup(ui):
212 def uisetup(ui):
163 ui.setconfig('hooks', 'preupdate.eol', preupdate)
213 ui.setconfig('hooks', 'preupdate.eol', preupdate)
164
214
165 def extsetup(ui):
215 def extsetup(ui):
166 try:
216 try:
167 extensions.find('win32text')
217 extensions.find('win32text')
168 raise util.Abort(_("the eol extension is incompatible with the "
218 raise util.Abort(_("the eol extension is incompatible with the "
169 "win32text extension"))
219 "win32text extension"))
170 except KeyError:
220 except KeyError:
171 pass
221 pass
172
222
173
223
174 def reposetup(ui, repo):
224 def reposetup(ui, repo):
175 uisetup(repo.ui)
225 uisetup(repo.ui)
176 #print "reposetup for", repo.root
226 #print "reposetup for", repo.root
177
227
178 if not repo.local():
228 if not repo.local():
179 return
229 return
180 for name, fn in filters.iteritems():
230 for name, fn in filters.iteritems():
181 repo.adddatafilter(name, fn)
231 repo.adddatafilter(name, fn)
182
232
183 ui.setconfig('patch', 'eol', 'auto')
233 ui.setconfig('patch', 'eol', 'auto')
184
234
185 class eolrepo(repo.__class__):
235 class eolrepo(repo.__class__):
186
236
187 _decode = {'LF': 'to-lf', 'CRLF': 'to-crlf', 'BIN': 'is-binary'}
237 def loadeol(self, node=None):
188 _encode = {'LF': 'to-lf', 'CRLF': 'to-crlf', 'BIN': 'is-binary'}
238 eol = parseeol(self.ui, self, node)
189
239 if eol is None:
190 def readhgeol(self, node=None):
191 try:
192 if node is None:
193 # Cannot use workingctx.data() since it would load
194 # and cache the filters before we configure them.
195 data = self.wfile('.hgeol').read()
196 else:
197 data = self[node]['.hgeol'].data()
198 except (IOError, LookupError):
199 return None
240 return None
200
241 eol.setfilters(self.ui)
201 if self.ui.config('eol', 'native', os.linesep) in ('LF', '\n'):
242 return eol.match
202 self._decode['NATIVE'] = 'to-lf'
203 else:
204 self._decode['NATIVE'] = 'to-crlf'
205
206 eol = config.config()
207 # Our files should not be touched. The pattern must be
208 # inserted first override a '** = native' pattern.
209 eol.set('patterns', '.hg*', 'BIN')
210 # We can then parse the user's patterns.
211 eol.parse('.hgeol', data)
212
213 if eol.get('repository', 'native') == 'CRLF':
214 self._encode['NATIVE'] = 'to-crlf'
215 else:
216 self._encode['NATIVE'] = 'to-lf'
217
218 for pattern, style in eol.items('patterns'):
219 key = style.upper()
220 try:
221 self.ui.setconfig('decode', pattern, self._decode[key])
222 self.ui.setconfig('encode', pattern, self._encode[key])
223 except KeyError:
224 self.ui.warn(_("ignoring unknown EOL style '%s' from %s\n")
225 % (style, eol.source('patterns', pattern)))
226
227 include = []
228 exclude = []
229 for pattern, style in eol.items('patterns'):
230 key = style.upper()
231 if key == 'BIN':
232 exclude.append(pattern)
233 else:
234 include.append(pattern)
235
236 # This will match the files for which we need to care
237 # about inconsistent newlines.
238 return match.match(self.root, '', [], include, exclude)
239
243
240 def _hgcleardirstate(self):
244 def _hgcleardirstate(self):
241 try:
245 try:
242 self._eolfile = self.readhgeol() or self.readhgeol('tip')
246 self._eolfile = (self.loadeol() or self.loadeol('tip'))
243 except error.ParseError, inst:
247 except error.ParseError, inst:
244 ui.warn(_("warning: ignoring .hgeol file due to parse error "
248 ui.warn(_("warning: ignoring .hgeol file due to parse error "
245 "at %s: %s\n") % (inst.args[1], inst.args[0]))
249 "at %s: %s\n") % (inst.args[1], inst.args[0]))
246 self._eolfile = None
250 self._eolfile = None
247
251
248 if not self._eolfile:
252 if not self._eolfile:
249 self._eolfile = util.never
253 self._eolfile = util.never
250 return
254 return
251
255
252 try:
256 try:
253 cachemtime = os.path.getmtime(self.join("eol.cache"))
257 cachemtime = os.path.getmtime(self.join("eol.cache"))
254 except OSError:
258 except OSError:
255 cachemtime = 0
259 cachemtime = 0
256
260
257 try:
261 try:
258 eolmtime = os.path.getmtime(self.wjoin(".hgeol"))
262 eolmtime = os.path.getmtime(self.wjoin(".hgeol"))
259 except OSError:
263 except OSError:
260 eolmtime = 0
264 eolmtime = 0
261
265
262 if eolmtime > cachemtime:
266 if eolmtime > cachemtime:
263 ui.debug("eol: detected change in .hgeol\n")
267 ui.debug("eol: detected change in .hgeol\n")
264 wlock = None
268 wlock = None
265 try:
269 try:
266 wlock = self.wlock()
270 wlock = self.wlock()
267 for f in self.dirstate:
271 for f in self.dirstate:
268 if self.dirstate[f] == 'n':
272 if self.dirstate[f] == 'n':
269 # all normal files need to be looked at
273 # all normal files need to be looked at
270 # again since the new .hgeol file might no
274 # again since the new .hgeol file might no
271 # longer match a file it matched before
275 # longer match a file it matched before
272 self.dirstate.normallookup(f)
276 self.dirstate.normallookup(f)
273 # Touch the cache to update mtime.
277 # Touch the cache to update mtime.
274 self.opener("eol.cache", "w").close()
278 self.opener("eol.cache", "w").close()
275 wlock.release()
279 wlock.release()
276 except error.LockUnavailable:
280 except error.LockUnavailable:
277 # If we cannot lock the repository and clear the
281 # If we cannot lock the repository and clear the
278 # dirstate, then a commit might not see all files
282 # dirstate, then a commit might not see all files
279 # as modified. But if we cannot lock the
283 # as modified. But if we cannot lock the
280 # repository, then we can also not make a commit,
284 # repository, then we can also not make a commit,
281 # so ignore the error.
285 # so ignore the error.
282 pass
286 pass
283
287
284 def commitctx(self, ctx, error=False):
288 def commitctx(self, ctx, error=False):
285 for f in sorted(ctx.added() + ctx.modified()):
289 for f in sorted(ctx.added() + ctx.modified()):
286 if not self._eolfile(f):
290 if not self._eolfile(f):
287 continue
291 continue
288 data = ctx[f].data()
292 data = ctx[f].data()
289 if util.binary(data):
293 if util.binary(data):
290 # We should not abort here, since the user should
294 # We should not abort here, since the user should
291 # be able to say "** = native" to automatically
295 # be able to say "** = native" to automatically
292 # have all non-binary files taken care of.
296 # have all non-binary files taken care of.
293 continue
297 continue
294 if inconsistenteol(data):
298 if inconsistenteol(data):
295 raise util.Abort(_("inconsistent newline style "
299 raise util.Abort(_("inconsistent newline style "
296 "in %s\n" % f))
300 "in %s\n" % f))
297 return super(eolrepo, self).commitctx(ctx, error)
301 return super(eolrepo, self).commitctx(ctx, error)
298 repo.__class__ = eolrepo
302 repo.__class__ = eolrepo
299 repo._hgcleardirstate()
303 repo._hgcleardirstate()
General Comments 0
You need to be logged in to leave comments. Login now