974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
1 |
============================
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
2 |
Guidelines for modifying bzr |
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
3 |
============================
|
4 |
||
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
5 |
.. contents:: |
6 |
||
7 |
(The current version of this document is available in the file ``HACKING`` |
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
8 |
in the source tree, or at http://doc.bazaar-vcs.org/current/hacking.html) |
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
9 |
|
10 |
Overall
|
|
11 |
=======
|
|
12 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
13 |
* New functionality should have test cases. Preferably write the |
14 |
test before writing the code. |
|
15 |
||
16 |
In general, you can test at either the command-line level or the |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
17 |
internal API level. See Writing Tests below for more detail. |
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
18 |
|
1185.33.48
by Martin Pool
Hacking notes on TDD |
19 |
* Try to practice Test-Driven Development. before fixing a bug, write a |
20 |
test case so that it does not regress. Similarly for adding a new |
|
21 |
feature: write a test case for a small version of the new feature before |
|
22 |
starting on the code itself. Check the test fails on the old code, then |
|
23 |
add the feature or fix and check it passes. |
|
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
24 |
|
25 |
* Exceptions should be defined inside bzrlib.errors, so that we can |
|
26 |
see the whole tree at a glance. |
|
27 |
||
28 |
* Imports should be done at the top-level of the file, unless there is |
|
29 |
a strong reason to have them lazily loaded when a particular |
|
30 |
function runs. Import statements have a cost, so try to make sure |
|
31 |
they don't run inside hot functions. |
|
32 |
||
33 |
* Module names should always be given fully-qualified,
|
|
34 |
i.e. ``bzrlib.hashcache`` not just ``hashcache``.
|
|
35 |
||
1185.33.48
by Martin Pool
Hacking notes on TDD |
36 |
* Commands should return non-zero when they encounter circumstances that
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
37 |
the user should really pay attention to - which includes trivial shell
|
38 |
pipelines.
|
|
39 |
||
1185.34.1
by Jelmer Vernooij
Fix a couple of typo's |
40 |
Recommended values are
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
41 |
0. OK,
|
42 |
1. Conflicts in merge-like operations, or changes are present in
|
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
43 |
diff-like operations.
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
44 |
2. Unrepresentable diff changes (i.e. binary files that we cannot show
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
45 |
a diff of).
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
46 |
3. An error or exception has occurred.
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
47 |
|
1393.1.54
by Martin Pool
- more hacking notes on evolving interfaces |
48 |
Evolving interfaces
|
49 |
-------------------
|
|
50 |
||
1534.2.4
by Robert Collins
Update NEWS and HACKING for the symbol_versioning module. |
51 |
We have a commitment to 6 months API stability - any supported symbol in a
|
52 |
release of bzr MUST NOT be altered in any way that would result in
|
|
53 |
breaking existing code that uses it. That means that method names,
|
|
54 |
parameter ordering, parameter names, variable and attribute names etc must
|
|
55 |
not be changed without leaving a 'deprecated forwarder' behind. This even |
|
56 |
applies to modules and classes.
|
|
57 |
||
58 |
If you wish to change the behaviour of a supported API in an incompatible
|
|
59 |
way, you need to change its name as well. For instance, if I add a optional keyword
|
|
60 |
parameter to branch.commit - that's fine. On the other hand, if I add a |
|
61 |
keyword parameter to branch.commit which is a *required* transaction |
|
62 |
object, I should rename the API - i.e. to 'branch.commit_transaction'. |
|
63 |
||
64 |
When renaming such supported API's, be sure to leave a deprecated_method (or |
|
65 |
_function or ...) behind which forwards to the new API. See the
|
|
66 |
bzrlib.symbol_versioning module for decorators that take care of the
|
|
67 |
details for you - such as updating the docstring, and issuing a warning
|
|
68 |
when the old api is used.
|
|
69 |
||
70 |
For unsupported API's, it does not hurt to follow this discipline, but its |
|
71 |
not required. Minimally though, please try to rename things so that |
|
72 |
callers will at least get an AttributeError rather than weird results. |
|
73 |
||
1393.1.54
by Martin Pool
- more hacking notes on evolving interfaces |
74 |
|
1534.3.1
by Robert Collins
* bzrlib.osutils.safe_unicode now exists to provide parameter coercion |
75 |
Standard parameter types |
76 |
------------------------
|
|
77 |
||
78 |
There are some common requirements in the library: some parameters need to be |
|
79 |
unicode safe, some need byte strings, and so on. At the moment we have |
|
80 |
only codified one specific pattern: Parameters that need to be unicode |
|
81 |
should be check via 'bzrlib.osutils.safe_unicode'. This will coerce the |
|
82 |
input into unicode in a consistent fashion, allowing trivial strings to be |
|
83 |
used for programmer convenience, but not performing unpredictably in the |
|
84 |
presence of different locales. |
|
85 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
86 |
Documentation
|
87 |
=============
|
|
88 |
||
89 |
If you change the behaviour of a command, please update its docstring |
|
90 |
in bzrlib/commands.py. This is displayed by the 'bzr help' command. |
|
91 |
||
1185.33.2
by Martin Pool
How to maintain the NEWS file |
92 |
NEWS file |
93 |
---------
|
|
94 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
95 |
If you make a user-visible change, please add a note to the NEWS file. |
96 |
The description should be written to make sense to someone who's just |
|
97 |
a user of bzr, not a developer: new functions or classes shouldn't be |
|
98 |
mentioned, but new commands, changes in behaviour or fixed nontrivial |
|
99 |
bugs should be listed. See the existing entries for an idea of what |
|
100 |
should be done. |
|
1098
by Martin Pool
- notes on how output is written |
101 |
|
1185.33.2
by Martin Pool
How to maintain the NEWS file |
102 |
Within each release, entries in the news file should have the most |
103 |
user-visible changes first. So the order should be approximately: |
|
104 |
||
105 |
* changes to existing behaviour - the highest priority because the |
|
106 |
user's existing knowledge is incorrect |
|
107 |
* new features - should be brought to their attention
|
|
108 |
* bug fixes - may be of interest if the bug was affecting them, and
|
|
109 |
should include the bug number if any
|
|
110 |
* major documentation changes
|
|
111 |
* changes to internal interfaces
|
|
112 |
||
113 |
People who made significant contributions to each change are listed in
|
|
114 |
parenthesis. This can include reporting bugs (particularly with good
|
|
115 |
details or reproduction recipes), submitting patches, etc.
|
|
1098
by Martin Pool
- notes on how output is written |
116 |
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
117 |
API documentation
|
118 |
-----------------
|
|
119 |
||
120 |
Functions, methods, classes and modules should have docstrings
|
|
121 |
describing how they are used.
|
|
122 |
||
123 |
The first line of the docstring should be a self-contained sentence.
|
|
124 |
||
125 |
For the special case of Command classes, this acts as the user-visible
|
|
126 |
documentation shown by the help command.
|
|
127 |
||
128 |
The docstrings should be formatted as reStructuredText_ (like this
|
|
129 |
document), suitable for processing using the epydoc_ tool into HTML
|
|
130 |
documentation.
|
|
131 |
||
132 |
.. _reStructuredText: http://docutils.sourceforge.net/rst.html
|
|
133 |
.. _epydoc: http://epydoc.sourceforge.net/
|
|
134 |
||
135 |
||
136 |
||
137 |
Coding style
|
|
138 |
============
|
|
139 |
||
140 |
Please write PEP-8__ compliant code.
|
|
141 |
||
142 |
One often-missed requirement is that the first line of docstrings
|
|
143 |
should be a self-contained one-sentence summary.
|
|
144 |
||
145 |
__ http://www.python.org/peps/pep-0008.html
|
|
146 |
||
147 |
||
148 |
||
149 |
Naming
|
|
150 |
------
|
|
151 |
||
152 |
Functions, methods or members that are in some sense "private" are given
|
|
153 |
a leading underscore prefix. This is just a hint that code outside the
|
|
154 |
implementation should probably not use that interface.
|
|
155 |
||
156 |
We prefer class names to be concatenated capital words (``TestCase``)
|
|
157 |
and variables, methods and functions to be lowercase words joined by
|
|
158 |
underscores (``revision_id``, ``get_revision``).
|
|
159 |
||
160 |
For the purposes of naming some names are treated as single compound
|
|
161 |
words: "filename", "revno".
|
|
162 |
||
163 |
Consider naming classes as nouns and functions/methods as verbs.
|
|
164 |
||
165 |
||
166 |
Standard names
|
|
167 |
--------------
|
|
168 |
||
169 |
``revision_id`` not ``rev_id`` or ``revid``
|
|
170 |
||
171 |
Functions that transform one thing to another should be named ``x_to_y``
|
|
172 |
(not ``x2y`` as occurs in some old code.)
|
|
173 |
||
1098
by Martin Pool
- notes on how output is written |
174 |
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
175 |
Destructors
|
176 |
-----------
|
|
177 |
||
1185.16.150
by Martin Pool
Improved description of python exception policies |
178 |
Python destructors (``__del__``) work differently to those of other
|
179 |
languages. In particular, bear in mind that destructors may be called
|
|
180 |
immediately when the object apparently becomes unreferenced, or at some
|
|
181 |
later time, or possibly never at all. Therefore we have restrictions on
|
|
182 |
what can be done inside them.
|
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
183 |
|
184 |
0. Never use a __del__ method without asking Martin/Robert first.
|
|
185 |
||
186 |
1. Never rely on a ``__del__`` method running. If there is code that
|
|
187 |
must run, do it from a ``finally`` block instead.
|
|
188 |
||
189 |
2. Never ``import`` from inside a ``__del__`` method, or you may crash the
|
|
190 |
interpreter!!
|
|
191 |
||
192 |
3. In some places we raise a warning from the destructor if the object
|
|
193 |
has not been cleaned up or closed. This is considered OK: the warning
|
|
194 |
may not catch every case but it's still useful sometimes. |
|
195 |
||
196 |
||
1740.2.5
by Aaron Bentley
Merge from bzr.dev |
197 |
Factories
|
198 |
---------
|
|
199 |
||
200 |
In some places we have variables which point to callables that construct |
|
201 |
new instances. That is to say, they can be used a lot like class objects, |
|
202 |
but they shouldn't be *named* like classes: |
|
203 |
||
204 |
> I think that things named FooBar should create instances of FooBar when
|
|
205 |
> called. Its plain confusing for them to do otherwise. When we have
|
|
206 |
> something that is going to be used as a class - that is, checked for via
|
|
207 |
> isinstance or other such idioms, them I would call it foo_class, so that
|
|
208 |
> it is clear that a callable is not sufficient. If it is only used as a
|
|
209 |
> factory, then yes, foo_factory is what I would use.
|
|
210 |
||
211 |
||
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
212 |
Lazy Imports
|
213 |
------------
|
|
214 |
||
215 |
To make startup time faster, we use the ``bzrlib.lazy_import`` module to
|
|
216 |
delay importing modules until they are actually used. ``lazy_import`` uses
|
|
217 |
the same syntax as regular python imports. So to import a few modules in a
|
|
218 |
lazy fashion do::
|
|
219 |
||
220 |
from bzrlib.lazy_import import lazy_import
|
|
221 |
lazy_import(globals(), """
|
|
222 |
import os
|
|
223 |
import subprocess
|
|
224 |
import sys
|
|
225 |
import time
|
|
226 |
||
227 |
from bzrlib import (
|
|
228 |
errors,
|
|
229 |
transport,
|
|
230 |
foo as bar,
|
|
231 |
)
|
|
232 |
import bzrlib.transport
|
|
233 |
import bzrlib.xml5
|
|
234 |
""")
|
|
235 |
||
236 |
At this point, all of these exist as a ``ImportReplacer`` object, ready to
|
|
237 |
be imported once a member is accessed.
|
|
238 |
||
239 |
||
240 |
Modules versus Members
|
|
241 |
~~~~~~~~~~~~~~~~~~~~~~
|
|
242 |
||
243 |
While it is possible for ``lazy_import()`` to import members of a module
|
|
244 |
wehn using the ``from module import member`` syntax, it is recommended to
|
|
245 |
only use that syntax to load sub modules ``from module import submodule``.
|
|
246 |
This is because variables and classes can frequently be used without
|
|
247 |
needing a sub-member for example::
|
|
248 |
||
249 |
lazy_import(globals(), """
|
|
250 |
from module import MyClass
|
|
251 |
""")
|
|
252 |
||
253 |
def test(x):
|
|
254 |
return isinstance(x, MyClass)
|
|
255 |
||
256 |
This will incorrectly fail, because ``MyClass`` is a ``ImportReplacer``
|
|
257 |
object, rather than the real class.
|
|
258 |
||
259 |
||
260 |
Passing to other variables
|
|
261 |
~~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
262 |
||
1996.1.26
by John Arbash Meinel
Update HACKING and docstrings |
263 |
It also is incorrect to assign ``ImportReplacer`` objects to other variables.
|
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
264 |
Because the replacer only knows about the original name, it is unable to
|
265 |
replace other variables. The ``ImportReplacer`` class will raise an
|
|
1996.1.26
by John Arbash Meinel
Update HACKING and docstrings |
266 |
``IllegalUseOfScopeReplacer`` exception if it can figure out that this
|
267 |
happened. But it requires accessing a member more than once from the new
|
|
268 |
variable, so some bugs are not detected right away.
|
|
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
269 |
|
270 |
||
1098
by Martin Pool
- notes on how output is written |
271 |
Writing output
|
272 |
==============
|
|
273 |
||
274 |
(The strategy described here is what we want to get to, but it's not |
|
275 |
consistently followed in the code at the moment.) |
|
276 |
||
277 |
bzrlib is intended to be a generically reusable library. It shouldn't |
|
278 |
write messages to stdout or stderr, because some programs that use it
|
|
279 |
might want to display that information through a GUI or some other
|
|
280 |
mechanism.
|
|
281 |
||
282 |
We can distinguish two types of output from the library:
|
|
283 |
||
284 |
1. Structured data representing the progress or result of an
|
|
285 |
operation. For example, for a commit command this will be a list
|
|
286 |
of the modified files and the finally committed revision number
|
|
287 |
and id.
|
|
288 |
||
289 |
These should be exposed either through the return code or by calls
|
|
290 |
to a callback parameter.
|
|
291 |
||
292 |
A special case of this is progress indicators for long-lived
|
|
293 |
operations, where the caller should pass a ProgressBar object.
|
|
294 |
||
295 |
2. Unstructured log/debug messages, mostly for the benefit of the
|
|
296 |
developers or users trying to debug problems. This should always
|
|
297 |
be sent through ``bzrlib.trace`` and Python ``logging``, so that
|
|
298 |
it can be redirected by the client.
|
|
299 |
||
300 |
The distinction between the two is a bit subjective, but in general if
|
|
301 |
there is any chance that a library would want to see something as
|
|
302 |
structured data, we should make it so.
|
|
303 |
||
304 |
The policy about how output is presented in the text-mode client
|
|
305 |
should be only in the command-line tool.
|
|
1092.1.22
by Robert Collins
update hacking with some test foo |
306 |
|
1418
by Robert Collins
merge martins latest |
307 |
|
1092.1.22
by Robert Collins
update hacking with some test foo |
308 |
Writing tests
|
309 |
=============
|
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
310 |
In general tests should be placed in a file named test_FOO.py where
|
1092.1.22
by Robert Collins
update hacking with some test foo |
311 |
FOO is the logical thing under test. That file should be placed in the
|
312 |
tests subdirectory under the package being tested.
|
|
313 |
||
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
314 |
For example, tests for merge3 in bzrlib belong in bzrlib/tests/test_merge3.py.
|
315 |
See bzrlib/selftest/test_sampler.py for a template test script.
|
|
316 |
||
317 |
Tests can be written for the UI or for individual areas of the library.
|
|
318 |
Choose whichever is appropriate: if adding a new command, or a new command
|
|
319 |
option, then you should be writing a UI test. If you are both adding UI
|
|
320 |
functionality and library functionality, you will want to write tests for
|
|
321 |
both the UI and the core behaviours. We call UI tests 'blackbox' tests |
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
322 |
and they are found in ``bzrlib/tests/blackbox/*.py``.
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
323 |
|
324 |
When writing blackbox tests please honour the following conventions:
|
|
325 |
||
326 |
1. Place the tests for the command 'name' in |
|
327 |
bzrlib/tests/blackbox/test_name.py. This makes it easy for developers
|
|
328 |
to locate the test script for a faulty command.
|
|
329 |
||
330 |
2. Use the 'self.run_bzr("name")' utility function to invoke the command |
|
331 |
rather than running bzr in a subprocess or invoking the
|
|
332 |
cmd_object.run() method directly. This is a lot faster than
|
|
333 |
subprocesses and generates the same logging output as running it in a
|
|
334 |
subprocess (which invoking the method directly does not).
|
|
335 |
|
|
336 |
3. Only test the one command in a single test script. Use the bzrlib
|
|
337 |
library when setting up tests and when evaluating the side-effects of
|
|
338 |
the command. We do this so that the library api has continual pressure
|
|
339 |
on it to be as functional as the command line in a simple manner, and
|
|
340 |
to isolate knock-on effects throughout the blackbox test suite when a
|
|
341 |
command changes it name or signature. Ideally only the tests for a
|
|
342 |
given command are affected when a given command is changed.
|
|
1393.1.61
by Martin Pool
doc |
343 |
|
1740.6.1
by Martin Pool
Remove Scratch objects used by doctests |
344 |
Doctests
|
345 |
--------
|
|
346 |
||
347 |
We make selective use of doctests__. In general they should provide
|
|
348 |
*examples* within the API documentation which can incidentally be tested. We
|
|
349 |
don't try to test every important case using doctests -- regular Python |
|
350 |
tests are generally a better solution. |
|
351 |
||
352 |
Most of these are in ``bzrlib/doc/api``. More additions are welcome. |
|
353 |
||
354 |
__ http://docs.python.org/lib/module-doctest.html |
|
355 |
||
356 |
||
1092.1.22
by Robert Collins
update hacking with some test foo |
357 |
Running tests |
358 |
=============
|
|
359 |
Currently, bzr selftest is used to invoke tests. |
|
360 |
You can provide a pattern argument to run a subset. For example, |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
361 |
to run just the blackbox tests, run:: |
1393.1.61
by Martin Pool
doc |
362 |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
363 |
./bzr selftest -v blackbox |
1393.1.61
by Martin Pool
doc |
364 |
|
1551.6.41
by Aaron Bentley
Add advice on skipping tests to HACKING |
365 |
To skip a particular test (or set of tests), you need to use a negative |
366 |
match, like so:: |
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
367 |
|
1551.6.41
by Aaron Bentley
Add advice on skipping tests to HACKING |
368 |
./bzr selftest '^(?!.*blackbox)' |
369 |
||
1393.1.61
by Martin Pool
doc |
370 |
|
371 |
Errors and exceptions |
|
372 |
=====================
|
|
373 |
||
1185.16.61
by mbp at sourcefrog
- start introducing hct error classes |
374 |
Errors are handled through Python exceptions. They can represent user |
375 |
errors, environmental errors or program bugs. Sometimes we can't be sure |
|
376 |
at the time it's raised which case applies. See bzrlib/errors.py for |
|
377 |
details on the error-handling practices. |
|
1092.1.22
by Robert Collins
update hacking with some test foo |
378 |
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
379 |
|
380 |
Jargon
|
|
381 |
======
|
|
382 |
||
383 |
revno
|
|
384 |
Integer identifier for a revision on the main line of a branch. |
|
385 |
Revision 0 is always the null revision; others are 1-based |
|
386 |
indexes into the branch's revision history. |
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
387 |
|
1185.33.98
by Martin Pool
Add notes on merge/review process. |
388 |
|
1684.1.3
by Martin Pool
(HACKING) some notes on handling unicode & urls for transports |
389 |
Transport
|
390 |
=========
|
|
391 |
||
392 |
The ``Transport`` layer handles access to local or remote directories.
|
|
393 |
Each Transport object acts like a logical connection to a particular
|
|
394 |
directory, and it allows various operations on files within it. You can
|
|
395 |
*clone* a transport to get a new Transport connected to a subdirectory or
|
|
396 |
parent directory.
|
|
397 |
||
398 |
Transports are not used for access to the working tree. At present
|
|
399 |
working trees are always local and they are accessed through the regular
|
|
400 |
Python file io mechanisms.
|
|
401 |
||
402 |
filenames vs URLs
|
|
403 |
-----------------
|
|
404 |
||
405 |
Transports work in URLs. Take note that URLs are by definition only
|
|
406 |
ASCII - the decision of how to encode a Unicode string into a URL must be
|
|
407 |
taken at a higher level, typically in the Store. (Note that Stores also
|
|
408 |
escape filenames which cannot be safely stored on all filesystems, but
|
|
409 |
this is a different level.)
|
|
410 |
||
411 |
The main reason for this is that it's not possible to safely roundtrip a |
|
412 |
URL into Unicode and then back into the same URL. The URL standard |
|
413 |
gives a way to represent non-ASCII bytes in ASCII (as %-escapes), but |
|
414 |
doesn't say how those bytes represent non-ASCII characters. (They're not |
|
415 |
guaranteed to be UTF-8 -- that is common but doesn't happen everywhere.) |
|
416 |
||
417 |
For example if the user enters the url ``http://example/%e0`` there's no |
|
418 |
way to tell whether that character represents "latin small letter a with
|
|
419 |
grave" in iso-8859-1, or "latin small letter r with acute" in iso-8859-2
|
|
420 |
or malformed UTF-8. So we can't convert their URL to Unicode reliably. |
|
421 |
||
422 |
Equally problematic if we're given a url-like string containing non-ascii |
|
423 |
characters (such as the accented a) we can't be sure how to convert that |
|
424 |
to the correct URL, because we don't know what encoding the server expects |
|
425 |
for those characters. (Although this is not totally reliable we might still
|
|
426 |
accept these and assume they should be put into UTF-8.)
|
|
427 |
||
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
428 |
A similar edge case is that the url ``http://foo/sweet%2Fsour`` contains
|
1684.1.3
by Martin Pool
(HACKING) some notes on handling unicode & urls for transports |
429 |
one directory component whose name is "sweet/sour". The escaped slash is
|
430 |
not a directory separator. If we try to convert URLs to regular Unicode
|
|
431 |
paths this information will be lost.
|
|
432 |
||
433 |
This implies that Transports must natively deal with URLs; for simplicity
|
|
434 |
they *only* deal with URLs and conversion of other strings to URLs is done
|
|
435 |
elsewhere. Information they return, such as from ``list_dir``, is also in
|
|
436 |
the form of URL components.
|
|
437 |
||
438 |
||
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
439 |
Unicode and Encoding Support
|
440 |
============================
|
|
441 |
||
442 |
This section discusses various techniques that Bazaar uses to handle
|
|
443 |
characters that are outside the ASCII set.
|
|
444 |
||
445 |
``Command.outf``
|
|
446 |
----------------
|
|
447 |
||
448 |
When a ``Command`` object is created, it is given a member variable
|
|
449 |
accessible by ``self.outf``. This is a file-like object, which is bound to
|
|
450 |
``sys.stdout``, and should be used to write information to the screen,
|
|
451 |
rather than directly writing to ``sys.stdout`` or calling ``print``.
|
|
452 |
This file has the ability to translate Unicode objects into the correct
|
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
453 |
representation, based on the console encoding. Also, the class attribute
|
454 |
``encoding_type`` will effect how unprintable characters will be
|
|
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
455 |
handled. This parameter can take one of 3 values:
|
456 |
||
457 |
replace
|
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
458 |
Unprintable characters will be represented with a suitable replacement
|
459 |
marker (typically '?'), and no exception will be raised. This is for |
|
460 |
any command which generates text for the user to review, rather than
|
|
461 |
for automated processing.
|
|
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
462 |
For example: ``bzr log`` should not fail if one of the entries has text
|
463 |
that cannot be displayed.
|
|
464 |
|
|
465 |
strict
|
|
466 |
Attempting to print and unprintable character will cause a UnicodeError.
|
|
467 |
This is for commands that are intended more as scripting support, rather
|
|
468 |
than plain user review.
|
|
469 |
For exampl: ``bzr ls`` is designed to be used with shell scripting. One
|
|
470 |
use would be ``bzr ls --null --unknows | xargs -0 rm``. If ``bzr``
|
|
471 |
printed a filename with a '?', the wrong file could be deleted. (At the |
|
472 |
very least, the correct file would not be deleted). An error is used to
|
|
473 |
indicate that the requested action could not be performed.
|
|
474 |
|
|
475 |
exact
|
|
476 |
Do not attempt to automatically convert Unicode strings. This is used
|
|
477 |
for commands that must handle conversion themselves.
|
|
478 |
For example: ``bzr diff`` needs to translate Unicode paths, but should
|
|
479 |
not change the exact text of the contents of the files.
|
|
480 |
||
481 |
||
482 |
``bzrlib.urlutils.unescape_for_display``
|
|
483 |
----------------------------------------
|
|
484 |
||
485 |
Because Transports work in URLs (as defined earlier), printing the raw URL
|
|
486 |
to the user is usually less than optimal. Characters outside the standard
|
|
487 |
set are printed as escapes, rather than the real character, and local
|
|
488 |
paths would be printed as ``file://`` urls. The function
|
|
489 |
``unescape_for_display`` attempts to unescape a URL, such that anything
|
|
490 |
that cannot be printed in the current encoding stays an escaped URL, but
|
|
491 |
valid characters are generated where possible.
|
|
492 |
||
493 |
||
1185.33.98
by Martin Pool
Add notes on merge/review process. |
494 |
Merge/review process
|
495 |
====================
|
|
496 |
||
497 |
If you'd like to propose a change, please post to the |
|
498 |
bazaar-ng@lists.canonical.com list with a patch, bzr changeset, or link to a |
|
499 |
branch. Please put '[patch]' in the subject so we can pick them out, and |
|
500 |
include some text explaining the change. Remember to put an update to the NEWS |
|
501 |
file in your diff, if it makes any changes visible to users or plugin |
|
502 |
developers. Please include a diff against mainline if you're giving a link to |
|
503 |
a branch.
|
|
504 |
||
505 |
Please indicate if you think the code is ready to merge, or if it's just a |
|
506 |
draft or for discussion. If you want comments from many developers rather than |
|
507 |
to be merged, you can put '[rfc]' in the subject lines. |
|
508 |
||
509 |
Anyone is welcome to review code. There are broadly three gates for |
|
510 |
code to get in: |
|
511 |
||
512 |
* Doesn't reduce test coverage: if it adds new methods or commands, |
|
513 |
there should be tests for them. There is a good test framework
|
|
514 |
and plenty of examples to crib from, but if you are having trouble
|
|
515 |
working out how to test something feel free to post a draft patch
|
|
516 |
and ask for help.
|
|
517 |
||
518 |
* Doesn't reduce design clarity, such as by entangling objects |
|
519 |
we're trying to separate. This is mostly something the more |
|
520 |
experienced reviewers need to help check.
|
|
521 |
||
522 |
* Improves bugs, features, speed, or code simplicity.
|
|
523 |
||
524 |
Code that goes in should pass all three.
|
|
525 |
||
526 |
If you read a patch please reply and say so. We can use a numeric scale
|
|
527 |
of -1, -0, +0, +1, meaning respectively "really don't want it in current |
|
528 |
form", "somewhat uncomfortable", "ok with me", and "please put it in". |
|
529 |
Anyone can "vote". (It's not really voting, just a terse expression.) |
|
530 |
||
531 |
If something gets say two +1 votes from core reviewers, and no
|
|
532 |
vetos, then it's OK to come in. Any of the core developers can bring it
|
|
533 |
into their integration branch, which I'll merge regularly. (If you do
|
|
534 |
so, please reply and say so.)
|
|
535 |
||
536 |
||
1861.2.19
by Alexander Belchenko
HACKING: mention where to get instructions for building windows installers |
537 |
Making installers for OS Windows
|
538 |
================================
|
|
1861.2.20
by Alexander Belchenko
English |
539 |
To build a win32 installer, see the instructions on the wiki page:
|
1861.2.19
by Alexander Belchenko
HACKING: mention where to get instructions for building windows installers |
540 |
http://bazaar-vcs.org/BzrWin32Installer
|
541 |
||
542 |
||
1740.6.1
by Martin Pool
Remove Scratch objects used by doctests |
543 |
:: vim: ft=rst tw=74 ai
|