974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
1 |
============================
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
2 |
Guidelines for modifying bzr |
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
3 |
============================
|
4 |
||
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
5 |
.. contents:: |
6 |
||
7 |
(The current version of this document is available in the file ``HACKING`` |
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
8 |
in the source tree, or at http://doc.bazaar-vcs.org/current/hacking.html) |
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
9 |
|
10 |
Overall
|
|
11 |
=======
|
|
12 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
13 |
* New functionality should have test cases. Preferably write the |
14 |
test before writing the code. |
|
15 |
||
16 |
In general, you can test at either the command-line level or the |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
17 |
internal API level. See Writing Tests below for more detail. |
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
18 |
|
1185.33.48
by Martin Pool
Hacking notes on TDD |
19 |
* Try to practice Test-Driven Development. before fixing a bug, write a |
20 |
test case so that it does not regress. Similarly for adding a new |
|
21 |
feature: write a test case for a small version of the new feature before |
|
22 |
starting on the code itself. Check the test fails on the old code, then |
|
23 |
add the feature or fix and check it passes. |
|
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
24 |
|
25 |
* Exceptions should be defined inside bzrlib.errors, so that we can |
|
26 |
see the whole tree at a glance. |
|
27 |
||
28 |
* Imports should be done at the top-level of the file, unless there is |
|
29 |
a strong reason to have them lazily loaded when a particular |
|
30 |
function runs. Import statements have a cost, so try to make sure |
|
31 |
they don't run inside hot functions. |
|
32 |
||
33 |
* Module names should always be given fully-qualified,
|
|
34 |
i.e. ``bzrlib.hashcache`` not just ``hashcache``.
|
|
35 |
||
1185.33.48
by Martin Pool
Hacking notes on TDD |
36 |
* Commands should return non-zero when they encounter circumstances that
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
37 |
the user should really pay attention to - which includes trivial shell
|
38 |
pipelines.
|
|
39 |
||
1185.34.1
by Jelmer Vernooij
Fix a couple of typo's |
40 |
Recommended values are
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
41 |
0. OK,
|
42 |
1. Conflicts in merge-like operations, or changes are present in
|
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
43 |
diff-like operations.
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
44 |
2. Unrepresentable diff changes (i.e. binary files that we cannot show
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
45 |
a diff of).
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
46 |
3. An error or exception has occurred.
|
1476
by Robert Collins
Merge now has a retcode of 1 when conflicts occur. (Robert Collins) |
47 |
|
1393.1.54
by Martin Pool
- more hacking notes on evolving interfaces |
48 |
Evolving interfaces
|
49 |
-------------------
|
|
50 |
||
1534.2.4
by Robert Collins
Update NEWS and HACKING for the symbol_versioning module. |
51 |
We have a commitment to 6 months API stability - any supported symbol in a
|
52 |
release of bzr MUST NOT be altered in any way that would result in
|
|
53 |
breaking existing code that uses it. That means that method names,
|
|
54 |
parameter ordering, parameter names, variable and attribute names etc must
|
|
55 |
not be changed without leaving a 'deprecated forwarder' behind. This even |
|
56 |
applies to modules and classes.
|
|
57 |
||
58 |
If you wish to change the behaviour of a supported API in an incompatible
|
|
2063.3.1
by wang
fix typos |
59 |
way, you need to change its name as well. For instance, if I add an optional keyword
|
1534.2.4
by Robert Collins
Update NEWS and HACKING for the symbol_versioning module. |
60 |
parameter to branch.commit - that's fine. On the other hand, if I add a |
61 |
keyword parameter to branch.commit which is a *required* transaction |
|
62 |
object, I should rename the API - i.e. to 'branch.commit_transaction'. |
|
63 |
||
64 |
When renaming such supported API's, be sure to leave a deprecated_method (or |
|
65 |
_function or ...) behind which forwards to the new API. See the
|
|
66 |
bzrlib.symbol_versioning module for decorators that take care of the
|
|
67 |
details for you - such as updating the docstring, and issuing a warning
|
|
68 |
when the old api is used.
|
|
69 |
||
2063.3.1
by wang
fix typos |
70 |
For unsupported API's, it does not hurt to follow this discipline, but it's |
1534.2.4
by Robert Collins
Update NEWS and HACKING for the symbol_versioning module. |
71 |
not required. Minimally though, please try to rename things so that
|
72 |
callers will at least get an AttributeError rather than weird results.
|
|
73 |
||
1393.1.54
by Martin Pool
- more hacking notes on evolving interfaces |
74 |
|
1534.3.1
by Robert Collins
* bzrlib.osutils.safe_unicode now exists to provide parameter coercion |
75 |
Standard parameter types
|
76 |
------------------------
|
|
77 |
||
78 |
There are some common requirements in the library: some parameters need to be
|
|
79 |
unicode safe, some need byte strings, and so on. At the moment we have
|
|
80 |
only codified one specific pattern: Parameters that need to be unicode
|
|
2052.3.9
by John Arbash Meinel
Add an entry about copyright to HACKING |
81 |
should be checked via ``bzrlib.osutils.safe_unicode``. This will coerce the
|
1534.3.1
by Robert Collins
* bzrlib.osutils.safe_unicode now exists to provide parameter coercion |
82 |
input into unicode in a consistent fashion, allowing trivial strings to be
|
83 |
used for programmer convenience, but not performing unpredictably in the
|
|
84 |
presence of different locales.
|
|
85 |
||
2052.3.9
by John Arbash Meinel
Add an entry about copyright to HACKING |
86 |
|
87 |
Copyright
|
|
88 |
---------
|
|
89 |
||
90 |
The copyright policy for bzr was recently made clear in this email (edited
|
|
91 |
for grammatical correctness)::
|
|
92 |
||
93 |
The attached patch cleans up the copyright and license statements in
|
|
94 |
the bzr source. It also adds tests to help us remember to add them
|
|
95 |
with the correct text.
|
|
96 |
||
97 |
We had the problem that lots of our files were "Copyright Canonical
|
|
98 |
Development Ltd" which is not a real company, and some other variations
|
|
99 |
on this theme. Also, some files were missing the GPL statements.
|
|
100 |
|
|
101 |
I want to be clear about the intent of this patch, since copyright can
|
|
102 |
be a little controversial.
|
|
103 |
|
|
104 |
1) The big motivation for this is not to shut out the community, but
|
|
105 |
just to clean up all of the invalid copyright statements.
|
|
106 |
|
|
107 |
2) It has been the general policy for bzr that we want a single
|
|
108 |
copyright holder for all of the core code. This is following the model
|
|
109 |
set by the FSF, which makes it easier to update the code to a new
|
|
110 |
license in case problems are encountered. (For example, if we want to
|
|
111 |
upgrade the project universally to GPL v3 it is much simpler if there is
|
|
112 |
a single copyright holder). It also makes it clearer if copyright is
|
|
113 |
ever debated, there is a single holder, which makes it easier to defend
|
|
114 |
in court, etc. (I think the FSF position is that if you assign them
|
|
115 |
copyright, they can defend it in court rather than you needing to, and
|
|
116 |
I'm sure Canonical would do the same). |
|
117 |
As such, Canonical has requested copyright assignments from all of the |
|
118 |
major contributers. |
|
119 |
||
120 |
3) If someone wants to add code and not attribute it to Canonical, there |
|
121 |
is a specific list of files that are excluded from this check. And the |
|
122 |
test failure indicates where that is, and how to update it. |
|
123 |
||
124 |
4) If anyone feels that I changed a copyright statement incorrectly, just |
|
125 |
let me know, and I'll be happy to correct it. Whenever you have large |
|
126 |
mechanical changes like this, it is possible to make some mistakes.
|
|
127 |
|
|
128 |
Just to reiterate, this is a community project, and it is meant to stay
|
|
129 |
that way. Core bzr code is copyright Canonical for legal reasons, and
|
|
130 |
the tests are just there to help us maintain that.
|
|
131 |
||
132 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
133 |
Documentation
|
134 |
=============
|
|
135 |
||
2425.2.1
by Robert Collins
Command objects can now declare related help topics by having _see_also |
136 |
When you change bzrlib, please update the relevant documentation for the
|
137 |
change you made: Changes to commands should update their help, and
|
|
138 |
possibly end user tutorials; changes to the core library should be
|
|
139 |
reflected in API documentation.
|
|
140 |
||
141 |
Commands
|
|
142 |
--------
|
|
143 |
||
144 |
The docstring of a command is used by ``bzr help`` to generate help output
|
|
145 |
for the command. The list 'takes_options' attribute on a command is used by |
|
146 |
``bzr help`` to document the options for the command - the command
|
|
147 |
docstring does not need to document them. Finally, the '_see_also' |
|
148 |
attribute on a command can be used to reference other related help topics.
|
|
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
149 |
|
1185.33.2
by Martin Pool
How to maintain the NEWS file |
150 |
NEWS file
|
151 |
---------
|
|
152 |
||
974.1.26
by aaron.bentley at utoronto
merged mbp@sourcefrog.net-20050817233101-0939da1cf91f2472 |
153 |
If you make a user-visible change, please add a note to the NEWS file.
|
154 |
The description should be written to make sense to someone who's just |
|
155 |
a user of bzr, not a developer: new functions or classes shouldn't be |
|
156 |
mentioned, but new commands, changes in behaviour or fixed nontrivial
|
|
157 |
bugs should be listed. See the existing entries for an idea of what
|
|
158 |
should be done.
|
|
1098
by Martin Pool
- notes on how output is written |
159 |
|
1185.33.2
by Martin Pool
How to maintain the NEWS file |
160 |
Within each release, entries in the news file should have the most
|
161 |
user-visible changes first. So the order should be approximately:
|
|
162 |
||
163 |
* changes to existing behaviour - the highest priority because the
|
|
164 |
user's existing knowledge is incorrect |
|
165 |
* new features - should be brought to their attention |
|
166 |
* bug fixes - may be of interest if the bug was affecting them, and |
|
167 |
should include the bug number if any |
|
168 |
* major documentation changes |
|
169 |
* changes to internal interfaces |
|
170 |
||
171 |
People who made significant contributions to each change are listed in |
|
172 |
parenthesis. This can include reporting bugs (particularly with good |
|
173 |
details or reproduction recipes), submitting patches, etc. |
|
1098
by Martin Pool
- notes on how output is written |
174 |
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
175 |
API documentation |
176 |
-----------------
|
|
177 |
||
178 |
Functions, methods, classes and modules should have docstrings |
|
179 |
describing how they are used. |
|
180 |
||
181 |
The first line of the docstring should be a self-contained sentence. |
|
182 |
||
183 |
For the special case of Command classes, this acts as the user-visible |
|
184 |
documentation shown by the help command. |
|
185 |
||
186 |
The docstrings should be formatted as reStructuredText_ (like this |
|
187 |
document), suitable for processing using the epydoc_ tool into HTML |
|
188 |
documentation. |
|
189 |
||
190 |
.. _reStructuredText: http://docutils.sourceforge.net/rst.html |
|
191 |
.. _epydoc: http://epydoc.sourceforge.net/ |
|
192 |
||
193 |
||
194 |
||
195 |
Coding style |
|
196 |
============
|
|
197 |
||
198 |
Please write PEP-8__ compliant code. |
|
199 |
||
200 |
One often-missed requirement is that the first line of docstrings |
|
201 |
should be a self-contained one-sentence summary. |
|
202 |
||
203 |
__ http://www.python.org/peps/pep-0008.html |
|
204 |
||
205 |
||
206 |
||
207 |
Naming
|
|
208 |
------
|
|
209 |
||
210 |
Functions, methods or members that are in some sense "private" are given |
|
211 |
a leading underscore prefix. This is just a hint that code outside the |
|
212 |
implementation should probably not use that interface. |
|
213 |
||
214 |
We prefer class names to be concatenated capital words (``TestCase``) |
|
215 |
and variables, methods and functions to be lowercase words joined by |
|
216 |
underscores (``revision_id``, ``get_revision``). |
|
217 |
||
218 |
For the purposes of naming some names are treated as single compound |
|
219 |
words: "filename", "revno". |
|
220 |
||
221 |
Consider naming classes as nouns and functions/methods as verbs. |
|
222 |
||
2221.4.7
by Aaron Bentley
Add suggestion to HACKING |
223 |
Try to avoid using abbreviations in names, because there can be |
224 |
inconsistency if other people use the full name. |
|
225 |
||
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
226 |
|
227 |
Standard names |
|
228 |
--------------
|
|
229 |
||
230 |
``revision_id`` not ``rev_id`` or ``revid`` |
|
231 |
||
232 |
Functions that transform one thing to another should be named ``x_to_y`` |
|
233 |
(not ``x2y`` as occurs in some old code.) |
|
234 |
||
1098
by Martin Pool
- notes on how output is written |
235 |
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
236 |
Destructors
|
237 |
-----------
|
|
238 |
||
1185.16.150
by Martin Pool
Improved description of python exception policies |
239 |
Python destructors (``__del__``) work differently to those of other |
240 |
languages. In particular, bear in mind that destructors may be called |
|
241 |
immediately when the object apparently becomes unreferenced, or at some |
|
242 |
later time, or possibly never at all. Therefore we have restrictions on |
|
243 |
what can be done inside them. |
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
244 |
|
245 |
0. Never use a __del__ method without asking Martin/Robert first. |
|
246 |
||
247 |
1. Never rely on a ``__del__`` method running. If there is code that |
|
248 |
must run, do it from a ``finally`` block instead. |
|
249 |
||
250 |
2. Never ``import`` from inside a ``__del__`` method, or you may crash the |
|
251 |
interpreter!! |
|
252 |
||
253 |
3. In some places we raise a warning from the destructor if the object |
|
254 |
has not been cleaned up or closed. This is considered OK: the warning |
|
255 |
may not catch every case but it's still useful sometimes. |
|
256 |
||
257 |
||
1740.2.5
by Aaron Bentley
Merge from bzr.dev |
258 |
Factories
|
259 |
---------
|
|
260 |
||
261 |
In some places we have variables which point to callables that construct
|
|
262 |
new instances. That is to say, they can be used a lot like class objects,
|
|
263 |
but they shouldn't be *named* like classes: |
|
264 |
||
265 |
> I think that things named FooBar should create instances of FooBar when |
|
266 |
> called. Its plain confusing for them to do otherwise. When we have |
|
267 |
> something that is going to be used as a class - that is, checked for via |
|
268 |
> isinstance or other such idioms, them I would call it foo_class, so that |
|
269 |
> it is clear that a callable is not sufficient. If it is only used as a |
|
270 |
> factory, then yes, foo_factory is what I would use. |
|
271 |
||
272 |
||
1911.4.15
by John Arbash Meinel
Updated HACKING and docstrings per Martin's suggestions |
273 |
Registries
|
274 |
----------
|
|
275 |
||
276 |
Several places in Bazaar use (or will use) a registry, which is a |
|
277 |
mapping from names to objects or classes. The registry allows for |
|
278 |
loading in registered code only when it's needed, and keeping |
|
279 |
associated information such as a help string or description.
|
|
280 |
||
281 |
||
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
282 |
Lazy Imports
|
283 |
------------
|
|
284 |
||
285 |
To make startup time faster, we use the ``bzrlib.lazy_import`` module to
|
|
286 |
delay importing modules until they are actually used. ``lazy_import`` uses
|
|
287 |
the same syntax as regular python imports. So to import a few modules in a
|
|
288 |
lazy fashion do::
|
|
289 |
||
290 |
from bzrlib.lazy_import import lazy_import
|
|
291 |
lazy_import(globals(), """
|
|
292 |
import os
|
|
293 |
import subprocess
|
|
294 |
import sys
|
|
295 |
import time
|
|
296 |
||
297 |
from bzrlib import (
|
|
298 |
errors,
|
|
299 |
transport,
|
|
1996.3.37
by John Arbash Meinel
Update HACKING and TODO |
300 |
revision as _mod_revision,
|
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
301 |
)
|
302 |
import bzrlib.transport
|
|
303 |
import bzrlib.xml5
|
|
304 |
""")
|
|
305 |
||
306 |
At this point, all of these exist as a ``ImportReplacer`` object, ready to
|
|
1996.3.37
by John Arbash Meinel
Update HACKING and TODO |
307 |
be imported once a member is accessed. Also, when importing a module into
|
308 |
the local namespace, which is likely to clash with variable names, it is
|
|
2370.1.1
by Ian Clatworthy
Minor corrections to HACKING |
309 |
recommended to prefix it as ``_mod_<module>``. This makes it clearer that
|
1996.3.37
by John Arbash Meinel
Update HACKING and TODO |
310 |
the variable is a module, and these object should be hidden anyway, since
|
311 |
they shouldn't be imported into other namespaces. |
|
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
312 |
|
313 |
||
314 |
Modules versus Members |
|
315 |
~~~~~~~~~~~~~~~~~~~~~~
|
|
316 |
||
317 |
While it is possible for ``lazy_import()`` to import members of a module |
|
2063.3.1
by wang
fix typos |
318 |
when using the ``from module import member`` syntax, it is recommended to |
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
319 |
only use that syntax to load sub modules ``from module import submodule``. |
320 |
This is because variables and classes can frequently be used without |
|
321 |
needing a sub-member for example:: |
|
322 |
||
323 |
lazy_import(globals(), """ |
|
324 |
from module import MyClass
|
|
325 |
""") |
|
326 |
||
327 |
def test(x): |
|
328 |
return isinstance(x, MyClass) |
|
329 |
||
330 |
This will incorrectly fail, because ``MyClass`` is a ``ImportReplacer`` |
|
331 |
object, rather than the real class. |
|
332 |
||
333 |
||
334 |
Passing to other variables |
|
335 |
~~~~~~~~~~~~~~~~~~~~~~~~~~
|
|
336 |
||
1996.1.26
by John Arbash Meinel
Update HACKING and docstrings |
337 |
It also is incorrect to assign ``ImportReplacer`` objects to other variables. |
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
338 |
Because the replacer only knows about the original name, it is unable to |
339 |
replace other variables. The ``ImportReplacer`` class will raise an |
|
1996.1.26
by John Arbash Meinel
Update HACKING and docstrings |
340 |
``IllegalUseOfScopeReplacer`` exception if it can figure out that this |
341 |
happened. But it requires accessing a member more than once from the new |
|
342 |
variable, so some bugs are not detected right away. |
|
1996.1.20
by John Arbash Meinel
HACKING and NEWS |
343 |
|
344 |
||
1098
by Martin Pool
- notes on how output is written |
345 |
Writing output |
346 |
==============
|
|
347 |
||
348 |
(The strategy described here is what we want to get to, but it's not |
|
349 |
consistently followed in the code at the moment.)
|
|
350 |
||
351 |
bzrlib is intended to be a generically reusable library. It shouldn't |
|
352 |
write messages to stdout or stderr, because some programs that use it |
|
353 |
might want to display that information through a GUI or some other |
|
354 |
mechanism. |
|
355 |
||
356 |
We can distinguish two types of output from the library: |
|
357 |
||
358 |
1. Structured data representing the progress or result of an |
|
359 |
operation. For example, for a commit command this will be a list |
|
360 |
of the modified files and the finally committed revision number |
|
361 |
and id. |
|
362 |
||
363 |
These should be exposed either through the return code or by calls |
|
364 |
to a callback parameter. |
|
365 |
||
366 |
A special case of this is progress indicators for long-lived |
|
367 |
operations, where the caller should pass a ProgressBar object. |
|
368 |
||
369 |
2. Unstructured log/debug messages, mostly for the benefit of the |
|
370 |
developers or users trying to debug problems. This should always |
|
371 |
be sent through ``bzrlib.trace`` and Python ``logging``, so that |
|
372 |
it can be redirected by the client. |
|
373 |
||
374 |
The distinction between the two is a bit subjective, but in general if |
|
375 |
there is any chance that a library would want to see something as |
|
376 |
structured data, we should make it so. |
|
377 |
||
378 |
The policy about how output is presented in the text-mode client |
|
379 |
should be only in the command-line tool. |
|
1092.1.22
by Robert Collins
update hacking with some test foo |
380 |
|
1418
by Robert Collins
merge martins latest |
381 |
|
1092.1.22
by Robert Collins
update hacking with some test foo |
382 |
Writing tests |
383 |
=============
|
|
2067.2.2
by John Arbash Meinel
Review comments from Robert |
384 |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
385 |
In general tests should be placed in a file named test_FOO.py where |
1092.1.22
by Robert Collins
update hacking with some test foo |
386 |
FOO is the logical thing under test. That file should be placed in the |
387 |
tests subdirectory under the package being tested. |
|
388 |
||
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
389 |
For example, tests for merge3 in bzrlib belong in bzrlib/tests/test_merge3.py. |
2370.1.1
by Ian Clatworthy
Minor corrections to HACKING |
390 |
See bzrlib/tests/test_sampler.py for a template test script. |
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
391 |
|
392 |
Tests can be written for the UI or for individual areas of the library. |
|
393 |
Choose whichever is appropriate: if adding a new command, or a new command |
|
394 |
option, then you should be writing a UI test. If you are both adding UI |
|
395 |
functionality and library functionality, you will want to write tests for |
|
396 |
both the UI and the core behaviours. We call UI tests 'blackbox' tests |
|
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
397 |
and they are found in ``bzrlib/tests/blackbox/*.py``. |
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
398 |
|
399 |
When writing blackbox tests please honour the following conventions:
|
|
400 |
||
401 |
1. Place the tests for the command 'name' in
|
|
402 |
bzrlib/tests/blackbox/test_name.py. This makes it easy for developers
|
|
403 |
to locate the test script for a faulty command.
|
|
404 |
||
405 |
2. Use the 'self.run_bzr("name")' utility function to invoke the command
|
|
406 |
rather than running bzr in a subprocess or invoking the
|
|
407 |
cmd_object.run() method directly. This is a lot faster than
|
|
408 |
subprocesses and generates the same logging output as running it in a
|
|
409 |
subprocess (which invoking the method directly does not).
|
|
410 |
|
|
411 |
3. Only test the one command in a single test script. Use the bzrlib
|
|
412 |
library when setting up tests and when evaluating the side-effects of
|
|
413 |
the command. We do this so that the library api has continual pressure
|
|
414 |
on it to be as functional as the command line in a simple manner, and
|
|
415 |
to isolate knock-on effects throughout the blackbox test suite when a
|
|
2063.3.1
by wang
fix typos |
416 |
command changes its name or signature. Ideally only the tests for a
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
417 |
given command are affected when a given command is changed.
|
1393.1.61
by Martin Pool
doc |
418 |
|
2067.2.2
by John Arbash Meinel
Review comments from Robert |
419 |
4. If you have a test which does actually require running bzr in a
|
420 |
subprocess you can use ``run_bzr_subprocess``. By default the spawned
|
|
421 |
process will not load plugins unless ``--allow-plugins`` is supplied.
|
|
422 |
||
423 |
||
1740.6.1
by Martin Pool
Remove Scratch objects used by doctests |
424 |
Doctests
|
425 |
--------
|
|
426 |
||
427 |
We make selective use of doctests__. In general they should provide
|
|
428 |
*examples* within the API documentation which can incidentally be tested. We
|
|
429 |
don't try to test every important case using doctests -- regular Python
|
|
430 |
tests are generally a better solution.
|
|
431 |
||
432 |
Most of these are in ``bzrlib/doc/api``. More additions are welcome.
|
|
433 |
||
434 |
__ http://docs.python.org/lib/module-doctest.html
|
|
435 |
||
436 |
||
1092.1.22
by Robert Collins
update hacking with some test foo |
437 |
Running tests
|
438 |
=============
|
|
439 |
Currently, bzr selftest is used to invoke tests.
|
|
440 |
You can provide a pattern argument to run a subset. For example,
|
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
441 |
to run just the blackbox tests, run::
|
1393.1.61
by Martin Pool
doc |
442 |
|
1638.1.1
by Robert Collins
Update HACKING to reflect current test writing policy. |
443 |
./bzr selftest -v blackbox
|
1393.1.61
by Martin Pool
doc |
444 |
|
2394.2.6
by Ian Clatworthy
completed blackbox tests |
445 |
To skip a particular test (or set of tests), use the --exclude option
|
446 |
(shorthand -x) like so::
|
|
447 |
||
448 |
./bzr selftest -v -x blackbox
|
|
449 |
||
450 |
To list tests without running them, use the --list-only option like so::
|
|
451 |
||
452 |
./bzr selftest --list-only
|
|
453 |
||
454 |
This option can be combined with other selftest options (like -x) and
|
|
455 |
filter patterns to understand their effect.
|
|
1551.6.41
by Aaron Bentley
Add advice on skipping tests to HACKING |
456 |
|
1393.1.61
by Martin Pool
doc |
457 |
|
458 |
Errors and exceptions
|
|
459 |
=====================
|
|
460 |
||
2067.3.1
by Martin Pool
Clean up BzrNewError, other exception classes and users. |
461 |
Errors are handled through Python exceptions.
|
462 |
||
2067.3.5
by Martin Pool
Review comments |
463 |
We broadly classify errors as either being either internal or not,
|
464 |
depending on whether ``user_error`` is set or not. If we think it's our
|
|
465 |
fault, we show a backtrace, an invitation to report the bug, and possibly
|
|
466 |
other details. This is the default for errors that aren't specifically
|
|
467 |
recognized as being caused by a user error. Otherwise we show a briefer
|
|
468 |
message, unless -Derror was given.
|
|
2067.3.1
by Martin Pool
Clean up BzrNewError, other exception classes and users. |
469 |
|
470 |
Many errors originate as "environmental errors" which are raised by Python
|
|
471 |
or builtin libraries -- for example IOError. These are treated as being
|
|
472 |
our fault, unless they're caught in a particular tight scope where we know
|
|
473 |
that they indicate a user errors. For example if the repository format
|
|
474 |
is not found, the user probably gave the wrong path or URL. But if one of
|
|
475 |
the files inside the repository is not found, then it's our fault --
|
|
476 |
either there's a bug in bzr, or something complicated has gone wrong in
|
|
477 |
the environment that means one internal file was deleted.
|
|
478 |
||
479 |
Many errors are defined in ``bzrlib/errors.py`` but it's OK for new errors
|
|
480 |
to be added near the place where they are used.
|
|
481 |
||
482 |
Exceptions are formatted for the user by conversion to a string
|
|
483 |
(eventually calling their ``__str__`` method.) As a convenience the
|
|
484 |
``._fmt`` member can be used as a template which will be mapped to the
|
|
485 |
error's instance dict.
|
|
486 |
||
487 |
New exception classes should be defined when callers might want to catch
|
|
488 |
that exception specifically, or when it needs a substantially different
|
|
489 |
format string.
|
|
490 |
||
491 |
Exception strings should start with a capital letter and should not have a
|
|
2067.3.5
by Martin Pool
Review comments |
492 |
final fullstop. If long, they may contain newlines to break the text.
|
2067.3.1
by Martin Pool
Clean up BzrNewError, other exception classes and users. |
493 |
|
1092.1.22
by Robert Collins
update hacking with some test foo |
494 |
|
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
495 |
|
2423.3.2
by Martin Pool
doc BZR_PDB and sigquit |
496 |
Debugging
|
497 |
=========
|
|
498 |
||
499 |
Bazaar has a few facilities to help debug problems by going into pdb_, the
|
|
500 |
Python debugger.
|
|
501 |
||
2423.3.7
by Martin Pool
Add BZR_SIGQUIT_PDB=0 option to disable breakin. |
502 |
.. _pdb: http://docs.python.org/lib/debugger-commands.html
|
2423.3.2
by Martin Pool
doc BZR_PDB and sigquit |
503 |
|
504 |
If the ``BZR_PDB`` environment variable is set
|
|
505 |
then bzr will go into pdb post-mortem mode when an unhandled exception
|
|
506 |
occurs.
|
|
507 |
||
508 |
If you send a SIGQUIT signal to bzr, which can be done by pressing C-\ on Unix,
|
|
509 |
bzr will go into the debugger immediately. You can continue execution by
|
|
2423.3.7
by Martin Pool
Add BZR_SIGQUIT_PDB=0 option to disable breakin. |
510 |
typing ``c``. This can be disabled if necessary by setting the
|
511 |
environment variable ``BZR_SIGQUIT_PDB=0``.
|
|
2423.3.2
by Martin Pool
doc BZR_PDB and sigquit |
512 |
|
513 |
||
514 |
||
1393.1.53
by Martin Pool
- notes from coding-convention discussion |
515 |
Jargon
|
516 |
======
|
|
517 |
||
518 |
revno
|
|
519 |
Integer identifier for a revision on the main line of a branch.
|
|
520 |
Revision 0 is always the null revision; others are 1-based
|
|
521 |
indexes into the branch's revision history.
|
|
1185.16.85
by mbp at sourcefrog
- rules for using destructors |
522 |
|
1185.33.98
by Martin Pool
Add notes on merge/review process. |
523 |
|
1684.1.3
by Martin Pool
(HACKING) some notes on handling unicode & urls for transports |
524 |
Transport
|
525 |
=========
|
|
526 |
||
527 |
The ``Transport`` layer handles access to local or remote directories.
|
|
528 |
Each Transport object acts like a logical connection to a particular
|
|
529 |
directory, and it allows various operations on files within it. You can
|
|
530 |
*clone* a transport to get a new Transport connected to a subdirectory or
|
|
531 |
parent directory.
|
|
532 |
||
533 |
Transports are not used for access to the working tree. At present
|
|
534 |
working trees are always local and they are accessed through the regular
|
|
535 |
Python file io mechanisms.
|
|
536 |
||
537 |
filenames vs URLs
|
|
538 |
-----------------
|
|
539 |
||
540 |
Transports work in URLs. Take note that URLs are by definition only
|
|
541 |
ASCII - the decision of how to encode a Unicode string into a URL must be
|
|
542 |
taken at a higher level, typically in the Store. (Note that Stores also
|
|
543 |
escape filenames which cannot be safely stored on all filesystems, but
|
|
544 |
this is a different level.)
|
|
545 |
||
546 |
The main reason for this is that it's not possible to safely roundtrip a
|
|
547 |
URL into Unicode and then back into the same URL. The URL standard
|
|
548 |
gives a way to represent non-ASCII bytes in ASCII (as %-escapes), but
|
|
549 |
doesn't say how those bytes represent non-ASCII characters. (They're not
|
|
550 |
guaranteed to be UTF-8 -- that is common but doesn't happen everywhere.)
|
|
551 |
||
552 |
For example if the user enters the url ``http://example/%e0`` there's no
|
|
553 |
way to tell whether that character represents "latin small letter a with
|
|
554 |
grave" in iso-8859-1, or "latin small letter r with acute" in iso-8859-2
|
|
555 |
or malformed UTF-8. So we can't convert their URL to Unicode reliably.
|
|
556 |
||
557 |
Equally problematic if we're given a url-like string containing non-ascii
|
|
558 |
characters (such as the accented a) we can't be sure how to convert that
|
|
559 |
to the correct URL, because we don't know what encoding the server expects
|
|
560 |
for those characters. (Although this is not totally reliable we might still
|
|
561 |
accept these and assume they should be put into UTF-8.)
|
|
562 |
||
1711.2.94
by John Arbash Meinel
Update HACKING to be rst compliant |
563 |
A similar edge case is that the url ``http://foo/sweet%2Fsour`` contains
|
1684.1.3
by Martin Pool
(HACKING) some notes on handling unicode & urls for transports |
564 |
one directory component whose name is "sweet/sour". The escaped slash is
|
565 |
not a directory separator. If we try to convert URLs to regular Unicode
|
|
566 |
paths this information will be lost.
|
|
567 |
||
568 |
This implies that Transports must natively deal with URLs; for simplicity
|
|
569 |
they *only* deal with URLs and conversion of other strings to URLs is done
|
|
570 |
elsewhere. Information they return, such as from ``list_dir``, is also in
|
|
571 |
the form of URL components.
|
|
572 |
||
573 |
||
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
574 |
Unicode and Encoding Support
|
575 |
============================
|
|
576 |
||
577 |
This section discusses various techniques that Bazaar uses to handle
|
|
578 |
characters that are outside the ASCII set.
|
|
579 |
||
580 |
``Command.outf``
|
|
581 |
----------------
|
|
582 |
||
583 |
When a ``Command`` object is created, it is given a member variable
|
|
584 |
accessible by ``self.outf``. This is a file-like object, which is bound to
|
|
585 |
``sys.stdout``, and should be used to write information to the screen,
|
|
586 |
rather than directly writing to ``sys.stdout`` or calling ``print``.
|
|
587 |
This file has the ability to translate Unicode objects into the correct
|
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
588 |
representation, based on the console encoding. Also, the class attribute
|
589 |
``encoding_type`` will effect how unprintable characters will be
|
|
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
590 |
handled. This parameter can take one of 3 values:
|
591 |
||
592 |
replace
|
|
1711.2.96
by John Arbash Meinel
cleanup from suggestions by Robert and Martin |
593 |
Unprintable characters will be represented with a suitable replacement
|
594 |
marker (typically '?'), and no exception will be raised. This is for
|
|
595 |
any command which generates text for the user to review, rather than
|
|
596 |
for automated processing.
|
|
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
597 |
For example: ``bzr log`` should not fail if one of the entries has text
|
598 |
that cannot be displayed.
|
|
599 |
|
|
600 |
strict
|
|
2063.3.1
by wang
fix typos |
601 |
Attempting to print an unprintable character will cause a UnicodeError.
|
1711.2.95
by John Arbash Meinel
Add HACKING note for the self.outf parameter. |
602 |
This is for commands that are intended more as scripting support, rather
|
603 |
than plain user review.
|
|
604 |
For exampl: ``bzr ls`` is designed to be used with shell scripting. One
|
|
605 |
use would be ``bzr ls --null --unknows | xargs -0 rm``. If ``bzr``
|
|
606 |
printed a filename with a '?', the wrong file could be deleted. (At the
|
|
607 |
very least, the correct file would not be deleted). An error is used to
|
|
608 |
indicate that the requested action could not be performed.
|
|
609 |
|
|
610 |
exact
|
|
611 |
Do not attempt to automatically convert Unicode strings. This is used
|
|
612 |
for commands that must handle conversion themselves.
|
|
613 |
For example: ``bzr diff`` needs to translate Unicode paths, but should
|
|
614 |
not change the exact text of the contents of the files.
|
|
615 |
||
616 |
||
617 |
``bzrlib.urlutils.unescape_for_display``
|
|
618 |
----------------------------------------
|
|
619 |
||
620 |
Because Transports work in URLs (as defined earlier), printing the raw URL
|
|
621 |
to the user is usually less than optimal. Characters outside the standard
|
|
622 |
set are printed as escapes, rather than the real character, and local
|
|
623 |
paths would be printed as ``file://`` urls. The function
|
|
624 |
``unescape_for_display`` attempts to unescape a URL, such that anything
|
|
625 |
that cannot be printed in the current encoding stays an escaped URL, but
|
|
626 |
valid characters are generated where possible.
|
|
627 |
||
628 |
||
2405.2.2
by Andrew Bennetts
Add a brief section on portability to HACKING. |
629 |
Portability Tips
|
630 |
================
|
|
631 |
||
632 |
The ``bzrlib.osutils`` module has many useful helper functions, including
|
|
633 |
some more portable variants of functions in the standard library.
|
|
634 |
||
635 |
In particular, don't use ``shutil.rmtree`` unless it's acceptable for it
|
|
636 |
to fail on Windows if some files are readonly or still open elsewhere.
|
|
637 |
Use ``bzrlib.osutils.rmtree`` instead.
|
|
638 |
||
639 |
||
1185.33.98
by Martin Pool
Add notes on merge/review process. |
640 |
Merge/review process
|
641 |
====================
|
|
642 |
||
643 |
If you'd like to propose a change, please post to the
|
|
2234.5.1
by Wouter van Heyst
Update the mailing list address. |
644 |
bazaar@lists.canonical.com list with a patch, bzr changeset, or link to a
|
1185.33.98
by Martin Pool
Add notes on merge/review process. |
645 |
branch. Please put '[patch]' in the subject so we can pick them out, and
|
646 |
include some text explaining the change. Remember to put an update to the NEWS
|
|
647 |
file in your diff, if it makes any changes visible to users or plugin
|
|
648 |
developers. Please include a diff against mainline if you're giving a link to
|
|
649 |
a branch.
|
|
650 |
||
651 |
Please indicate if you think the code is ready to merge, or if it's just a
|
|
652 |
draft or for discussion. If you want comments from many developers rather than
|
|
653 |
to be merged, you can put '[rfc]' in the subject lines.
|
|
654 |
||
655 |
Anyone is welcome to review code. There are broadly three gates for
|
|
656 |
code to get in:
|
|
657 |
||
658 |
* Doesn't reduce test coverage: if it adds new methods or commands,
|
|
659 |
there should be tests for them. There is a good test framework
|
|
660 |
and plenty of examples to crib from, but if you are having trouble
|
|
661 |
working out how to test something feel free to post a draft patch
|
|
662 |
and ask for help.
|
|
663 |
||
664 |
* Doesn't reduce design clarity, such as by entangling objects
|
|
665 |
we're trying to separate. This is mostly something the more
|
|
666 |
experienced reviewers need to help check.
|
|
667 |
||
668 |
* Improves bugs, features, speed, or code simplicity.
|
|
669 |
||
670 |
Code that goes in should pass all three.
|
|
671 |
||
672 |
If you read a patch please reply and say so. We can use a numeric scale
|
|
673 |
of -1, -0, +0, +1, meaning respectively "really don't want it in current
|
|
674 |
form", "somewhat uncomfortable", "ok with me", and "please put it in".
|
|
675 |
Anyone can "vote". (It's not really voting, just a terse expression.)
|
|
676 |
||
677 |
If something gets say two +1 votes from core reviewers, and no
|
|
678 |
vetos, then it's OK to come in. Any of the core developers can bring it
|
|
679 |
into their integration branch, which I'll merge regularly. (If you do
|
|
680 |
so, please reply and say so.)
|
|
681 |
||
682 |
||
1739.1.2
by Robert Collins
More pyrex finesse, documentation. |
683 |
C Extension Modules
|
684 |
===================
|
|
685 |
||
686 |
We write some extensions in C using pyrex. We design these to work in
|
|
687 |
three scenarios:
|
|
2449.1.1
by Alexander Belchenko
fix RSTX wrong formatting in HACKING |
688 |
|
1739.1.2
by Robert Collins
More pyrex finesse, documentation. |
689 |
* User with no C compiler
|
690 |
* User with C compiler
|
|
691 |
* Developers
|
|
692 |
||
693 |
The recommended way to install bzr is to have a C compiler so that the
|
|
694 |
extensions can be built, but if no C compiler is present, the pure python
|
|
695 |
versions we supply will work, though more slowly.
|
|
696 |
||
697 |
For developers we recommend that pyrex be installed, so that the C
|
|
698 |
extensions can be changed if needed.
|
|
699 |
||
700 |
For the C extensions, the extension module should always match the
|
|
701 |
original python one in all respects (modulo speed). This should be
|
|
702 |
maintained over time.
|
|
703 |
||
704 |
To create an extension, add rules to setup.py for building it with pyrex,
|
|
705 |
and with distutils. Now start with an empty .pyx file. At the top add
|
|
706 |
"include 'yourmodule.py'". This will import the contents of foo.py into this
|
|
707 |
file at build time - remember that only one module will be loaded at
|
|
708 |
runtime. Now you can subclass classes, or replace functions, and only your
|
|
709 |
changes need to be present in the .pyx file.
|
|
710 |
||
711 |
Note that pyrex does not support all 2.4 programming idioms, so some
|
|
712 |
syntax changes may be required. I.e.
|
|
2449.1.1
by Alexander Belchenko
fix RSTX wrong formatting in HACKING |
713 |
|
1739.1.2
by Robert Collins
More pyrex finesse, documentation. |
714 |
- 'from foo import (bar, gam)' needs to change to not use the brackets.
|
715 |
- 'import foo.bar as bar' needs to be 'import foo.bar; bar = foo.bar'
|
|
2449.1.1
by Alexander Belchenko
fix RSTX wrong formatting in HACKING |
716 |
|
1739.1.2
by Robert Collins
More pyrex finesse, documentation. |
717 |
If the changes are too dramatic, consider
|
718 |
maintaining the python code twice - once in the .pyx, and once in the .py,
|
|
719 |
and no longer including the .py file.
|
|
720 |
||
1861.2.19
by Alexander Belchenko
HACKING: mention where to get instructions for building windows installers |
721 |
Making installers for OS Windows
|
722 |
================================
|
|
1861.2.20
by Alexander Belchenko
English |
723 |
To build a win32 installer, see the instructions on the wiki page:
|
1861.2.19
by Alexander Belchenko
HACKING: mention where to get instructions for building windows installers |
724 |
http://bazaar-vcs.org/BzrWin32Installer
|
725 |
||
726 |
||
1740.6.1
by Martin Pool
Remove Scratch objects used by doctests |
727 |
:: vim: ft=rst tw=74 ai
|