~bzr-pqm/bzr/bzr.dev

« back to all changes in this revision

Viewing changes to TODO

Committer: Martin Pool
Date: 2005-05-11 08:04:05 UTC
Revision ID: mbp@sourcefrog.net-20050511080405-c795b657aa629008

- more status form test fixes

files added:
.bzrignore

.rsyncexclude

NEWS

TODO

bzrlib/add.py

bzrlib/atomicfile.py

bzrlib/help.py

bzrlib/info.py

bzrlib/log.py

bzrlib/mdiff.py

bzrlib/newinventory.py

bzrlib/remotebranch.py

bzrlib/revfile.py

bzrlib/statcache.py

bzrlib/status.py

bzrlib/textinv.py

bzrlib/workingtree.py

contrib

contrib/add-bzr-to-baz

contrib/bash

contrib/bash/bzr

contrib/fortune

contrib/zsh

contrib/zsh/_bzr

doc/ignore.txt

doc/quotes.txt

doc/revfile-annotation.txt

doc/revfile.txt

doc/switch-in-branch.txt

elementtree

elementtree/ElementTree.py

elementtree/__init__.py

notes/new-inventory-sample.xml

setup.py

testbzr

urlgrabber

urlgrabber/__init__.py

urlgrabber/byterange.py

urlgrabber/grabber.py

urlgrabber/keepalive.py

urlgrabber/mirror.py

urlgrabber/progress.py

files removed:
doc/faq.txt

doc/quickref.txt

doc/roadmap.txt

doc/testing.txt

doc/work-order.txt

files renamed:
bzr.py => bzrlib/commands.py

files modified:
README

bzrlib/__init__.py

bzrlib/branch.py

bzrlib/check.py

bzrlib/diff.py

bzrlib/errors.py

bzrlib/inventory.py

bzrlib/osutils.py

bzrlib/revision.py

bzrlib/store.py

bzrlib/tests.py

bzrlib/textui.py

bzrlib/trace.py

bzrlib/tree.py

bzrlib/xml.py

doc/Makefile

doc/bitkeeper.txt

doc/compared-codeville.txt

doc/darcs.txt

doc/formats.txt

doc/index.txt

doc/interrupted.txt

doc/merge.txt

doc/purpose.txt

doc/python.txt

doc/random.txt

doc/svk.txt

doc/thanks.txt

doc/todo-from-arch.txt

notes/performance.txt

Show diffs side-by-side

added added

removed removed

TODO

.. -*- mode: indented-text; compile-command: "make -C doc" -*-

*******************

Things to do in bzr

*******************

See also various low-level TODOs in the source code. Try looking in

the list archive or on gmane.org for previous discussion of these

issues.

These are classified by approximate size: an hour or less, a day or

less, and several days or more.

Small things

------------

* Add of a file that was present in the base revision should put back

the previous file-id.

* Handle diff of files which do not have a trailing newline; probably

requires patching difflib to get it exactly right, or otherwise

calling out to GNU diff.

* Import ElementTree update patch.

* Plugins that provide commands. By just installing a file into some

directory (e.g. ``/usr/share/bzr/plugins``) it should be possible to

create new top-level commands (``bzr frob``). Extensions can be

written in either Python (in which case they use the bzrlib API) or

in a separate process (in sh, C, whatever). It should be possible

to get help for plugin commands.

* Smart rewrap text in help messages to fit in $COLUMNS (or equivalent

on Windows)

* -r option should take a revision-id as well as a revno.

* ``bzr info`` could show space used by working tree, versioned files,

unknown and ignored files.

* ``bzr info`` should count only people with distinct email addresses as

different committers. (Or perhaps only distinct userids?)

* On Windows, command-line arguments should be `glob-expanded`__,

because the shell doesn't do this. However, there are probably some

commands where this shouldn't be done, such as 'bzr ignore', because

we want to accept globs.

* ``bzr ignore`` command that just adds a line to the ``.bzrignore`` file

and makes it versioned. Fix this to break symlinks.

* Any useful sanity checks in 'bzr ignore'? Perhaps give a warning if

they try to add a single file which is already versioned, or if they

add a pattern which already exists, or if it looks like they gave an

unquoted glob.

__ http://mail.python.org/pipermail/python-list/2001-April/037847.html

* Separate read and write version checks?

* ``bzr status DIR`` should give status on all files under that

directory.

* Check all commands have decent help.

* ``bzr inventory -r REV`` and perhaps unify this with ``bzr ls``,

giving options to display ids, types, etc.

* Atomic file class that renames into place when it's closed.

* Don't abort if ``~/.bzr.log`` can't be used.

* Split BzrError into various more specific subclasses for different

errors people might want to catch.

* If the export destination ends in '.tar', '.tar.gz', etc then create

a tarball instead of a directory. (Need to actually make a

temporary directory and then tar that up.)

http://www.gelato.unsw.edu.au/archives/git/0504/2194.html

* testbzr should by default test the bzr binary in the same directory

as the testbzr script, or take a path to it as a first parameter.

Should show the version from bzr and the path name.

* RemoteBranch could maintain a cache either in memory or on disk. We

know more than an external cache might about which files are

immutable and which can vary. On the other hand, it's much simpler

to just use an external proxy cache.

Medium things

-------------

* Change command functions into Command() objects, like in hct, and

100

then the grammar can be described directly in there. Since all

101

option definitions are global we can define them just once and

102

reference them from each command.

103

104

* Selective commit of only some files.

105

106

* Merge Aaron's merge code.

107

108

* Merge revert patch.

109

110

* ``bzr mv`` that does either rename or move as in Unix.

111

112

* More efficient diff of only selected files. We should be able to

113

just get the id for the selected files, look up their location and

114

diff just those files. No need to traverse the entire inventories.

115

116

* ``bzr status DIR`` or ``bzr diff DIR`` should report on all changes

117

under that directory.

118

119

* Fix up Inventory objects to represent root object as an entry.

120

121

* Don't convert entire entry from

122

123

* Extract changes from one revision to the next to a text form

124

suitable for transmission over email.

125

126

* More test cases.

127

128

* Write a reproducible benchmark, perhaps importing various kernel versions.

129

130

* Change test.sh from Bourne shell into something in pure Python so

131

that it can be more portable.

132

133

* Directly import diffs! It seems a bit redundant to need to rescan

134

the directory to work out what files diff added/deleted/changed when

135

all the information is there in the diff in the first place.

136

Getting the exact behaviour for added/deleted subdirectories etc

137

might be hard.

138

139

At the very least we could run diffstat over the diff, or perhaps

140

read the status output from patch. Just knowing which files might

141

be modified would be enough to guide the add and commit.

142

143

Given this we might be able to import patches at 1/second or better.

144

145

* Get branch over http.

146

147

* Pull pure updates over http.

148

149

* revfile compression.

150

151

* Split inventory into per-directory files.

152

153

* Fix ignore file parsing:

154

155

- fnmatch is not the same as unix patterns

156

157

- perhaps add extended globs from rsh/rsync

158

159

- perhaps a pattern that matches only directories or non-directories

160

161

* Consider using Python logging library as well as/instead of

162

bzrlib.trace.

163

164

* Commands should give some progress indication by default.

165

166

- But quieten this with ``--silent``.

167

168

* Change to using gettext message localization.

169

170

* Make a clearer separation between internal and external bzrlib

171

interfaces. Make internal interfaces use protected names. Write at

172

least some documentation for those APIs, probably as docstrings.

173

174

Consider using ZopeInterface definitions for the external interface;

175

I think these are already used in PyBaz. They allow automatic

176

checking of the interface but may be unfamiliar to general Python

177

developers, so I'm not really keen.

178

179

* Commands to dump out all command help into a manpage or HTML file or

180

whatever.

181

182

* Handle symlinks in the working directory; at the very least it

183

should be possible for them to be present and ignored/unknown

184

without causing assertion failures.

185

186

Eventually symlinks should be versioned.

187

188

* Allow init in a subdirectory to create a nested repository, but only

189

if the subdirectory is not already versioned. Perhaps also require

190

a ``--nested`` to protect against confusion.

191

192

* Branch names?

193

194

* More test framework:

195

196

- Class that describes the state of a working tree so we can just

197

assert it's equal.

198

199

* There are too many methods on Branch() that really manipulate the

200

WorkingTree. They should be moved across.

201

202

Also there are some methods which are duplicated on Tree and

203

Inventory objects, and it should be made more clear which ones are

204

proxies and which ones behave differently, and how.

205

206

* Try using XSLT to add some formatting to REST-generated HTML. Or

207

maybe write a small Python program that specifies a header and foot

208

for the pages and calls into the docutils libraries.

209

210

* --format=xml for log, status and other commands.

211

212

* Attempting to explicitly add a file that's already added should give

213

a warning; however there should be no warning for directories (since

214

we scan for new children) or files encountered in a directory that's

215

being scanned.

216

217

* Better handling of possible collisions on case-losing filesystems;

218

make sure a single file does not get added twice under different

219

names.

220

221

* Clean up XML inventory:

222

223

- Use nesting rather than parent_id pointers.

224

225

- Hold the ElementTree in memory in the Inventory object and work

226

directly on that, rather than converting into Python objects every

227

time it is read in. Probably still exposoe it through some kind of

228

object interface though, but perhaps that should just be a proxy

229

for the elements.

230

231

- Less special cases for the root directory.

232

233

* Perhaps inventories should remember the revision in which each file

234

was last changed, as well as its current state? This is a bit

235

redundant but might often be interested to know.

236

237

* stat cache should perhaps only stat files as necessary, rather than

238

doing them all up-front. On the other hand, that disallows the

239

opimization of stating them in inode order.

240

241

* It'd be nice to pipeline multiple HTTP requests. Often we can

242

predict what will be wanted in future: all revisions, or all texts

243

in a particular revision, etc.

244

245

urlgrabber's docs say they are working on batched downloads; we

246

could perhaps ride on that or just create a background thread (ew).

247

248

* Should be a signature at the top of the cache file.

249

250

* Paranoid mode where we never trust SHA-1 matches.

251

252

253

Large things

254

------------

255

256

* Generate annotations from current file relative to previous

257

annotations.

258

259

- Is it necessary to store any kind of annotation where data was

260

deleted?

261

262

* Update revfile_ format and make it active:

263

264

- Texts should be identified by something keyed on the revision, not

265

an individual text-id. This is much more useful for annotate I

266

think; we want to map back to the revision that last changed it.

267

268

- Access revfile revisions through the Tree/Store classes.

269

270

- Check them from check commands.

271

272

- Store annotations.

273

274

.. _revfile: revfile.html

275

276

* Hooks for pre-commit, post-commit, etc.

277

278

Consider the security implications; probably should not enable hooks

279

for remotely-fetched branches by default.

280

281

* Pre-commit check. If this hook is defined, it needs to be handled

282

specially: create a temporary directory containing the tree as it

283

will be after the commit. This means excluding any ignored/unknown

284

files, and respecting selective commits. Run the pre-commit check

285

(e.g. compile and run test suite) in there.

286

287

* Web interface

288

289

* GUI (maybe in Python GTK+?)

290

291

* C library interface

292

293

* Expansion of $Id$ keywords within working files. Perhaps do this in

294

exports first as a simpler case because then we don't need to deal

295

with removing the tags on the way back in.

296

297

* ``bzr find``

Older »