~bzr-pqm/bzr/bzr.dev

« back to all changes in this revision

Viewing changes to doc/developers/incremental-push-pull.txt

Committer: Martin Pool
Date: 2007-06-15 07:01:24 UTC
mfrom: (2528 +trunk)
mto: This revision was merged to the branch mainline in revision 2530.
Revision ID: mbp@sourcefrog.net-20070615070124-clpwqh5gxc4wbf9l

Merge trunk

files added:
bzrlib/branchbuilder.py

bzrlib/counted_lock.py

bzrlib/tests/branch_implementations/test_sprout.py

bzrlib/tests/test_branchbuilder.py

bzrlib/tests/test_counted_lock.py

bzrlib/tests/test_info.py

bzrlib/tests/test_lsprof.py

doc/developers/add.txt

doc/developers/annotate.txt

doc/developers/bundle-creation.txt

doc/developers/container-format.txt

doc/developers/gc.txt

doc/developers/initial-push-pull.txt

doc/developers/merge-scaling.txt

doc/developers/performance-commit.txt

doc/developers/performance.dot

doc/developers/planned-performance-changes.txt

doc/developers/profiling.txt

doc/developers/revert.txt

files modified:
Makefile

NEWS

README

bzrlib/__init__.py

bzrlib/branch.py

bzrlib/builtins.py

bzrlib/commands.py

bzrlib/commit.py

bzrlib/dirstate.py

bzrlib/errors.py

bzrlib/help_topics.py

bzrlib/info.py

bzrlib/log.py

bzrlib/lsprof.py

bzrlib/merge.py

bzrlib/missing.py

bzrlib/osutils.py

bzrlib/symbol_versioning.py

bzrlib/tests/HTTPTestUtil.py

bzrlib/tests/__init__.py

bzrlib/tests/blackbox/test_help.py

bzrlib/tests/blackbox/test_info.py

bzrlib/tests/blackbox/test_init.py

bzrlib/tests/blackbox/test_log.py

bzrlib/tests/blackbox/test_revision_info.py

bzrlib/tests/branch_implementations/__init__.py

bzrlib/tests/branch_implementations/test_branch.py

bzrlib/tests/test_ancestry.py

bzrlib/tests/test_commit.py

bzrlib/tests/test_dirstate.py

bzrlib/tests/test_http.py

bzrlib/tests/test_lockdir.py

bzrlib/tests/test_log.py

bzrlib/tests/test_merge.py

bzrlib/tests/test_missing.py

bzrlib/tests/test_repository.py

bzrlib/tests/test_revert.py

bzrlib/tests/test_selftest.py

bzrlib/tests/test_transform.py

bzrlib/tests/test_treebuilder.py

bzrlib/tests/test_urlutils.py

bzrlib/tests/workingtree_implementations/test_remove.py

bzrlib/tests/workingtree_implementations/test_workingtree.py

bzrlib/transform.py

bzrlib/transport/http/__init__.py

bzrlib/transport/http/_pycurl.py

bzrlib/ui/__init__.py

bzrlib/urlutils.py

bzrlib/version.py

bzrlib/workingtree.py

bzrlib/workingtree_4.py

contrib/bash/bzr.simple

doc/developers/HACKING

doc/developers/incremental-push-pull.txt

doc/developers/index.txt

doc/developers/performance-roadmap-rationale.txt

doc/developers/performance-roadmap.txt

doc/developers/performance-use-case-analysis.txt

doc/tutorial.txt

setup.py

Show diffs side-by-side

added added

removed removed

doc/developers/incremental-push-pull.txt

Incremental push/pull

---------------------

=====================

This use case covers pulling in or pushing out some number of revisions which

is typically a small fraction of the number already present in the target

responsibility of the Repository object.

Functional Requirements

=======================

-----------------------

A push or pull operation must:

* Copy all the data to reconstruct the selected revisions in the target

data, corrupted data should not be incorporated accidentally.

Factors which should add work for push/pull

===========================================

-------------------------------------------

* Baseline overhead: The time to connect to both branches.

* Actual new data in the revisions being pulled (drives the amount of data to

determination of what revisions to move around).

Push/pull overview

==================

------------------

1. New data is identified in the source repository.

2. That data is read from the source repository.

manner that its not visible to readers until its ready for use.

New data identification

+++++++++++++++++++++++

~~~~~~~~~~~~~~~~~~~~~~~

We have a single top level data object: revisions. Everything else is

subordinate to revisions, so determining the revisions to propogate should be

124

125

126

Data reading

127

++++++++++++

127

~~~~~~~~~~~~

128

129

When transferring information about a revision the graph of data for the

130

revision is walked: revision -> inventory, revision -> matching signature,

156

157

158

Data Verification and writing

159

+++++++++++++++++++++++++++++

159

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

160

161

New data written to a repository should be completed intact when it is made

162

visible. This suggests that either all the data for a revision must be made

240

transmission method to reasonably closely match the desired write ordering

241

locally. This suggests that once we decide on the best local storage means we

242

should design the api.

243

244

245

take N commits from A to B, if B is local then merge changes into the tree.

246

copy ebough data to recreate snapshots

247

avoid ending up wth corrupt/bad data

248

249

Notes from London

250

-----------------

251

252

#. setup

253

254

look at graph of revisions for ~N comits to deretmine eligibility for

255

if preserve mainline is on, check LH only

256

257

identify objects to send that are not on the client repo

258

- revision - may be proportional to the graph

259

- inventory - proportional to work

260

- texts - proportional to work

261

- signatures - ???

262

263

#. data transmission

264

265

* send data proportional to the new information

266

* validate the data:

267

268

#. validate the sha1 of the full text of each transmitted text.

269

#. validate the sha1:name mapping in each newly referenced inventory item.

270

#. validate the sha1 of the XML of each inventory against the revision.

271

**this is proportional to tree size and must be fixed**

272

273

#. write the data to the local repo.

274

The API should output the file texts needed by the merge as by product of the transmission

275

276

#. tree application

277

278

Combine the output from the transmission step with additional 'new work data' for anything already in the local repository that is new in this tree.

279

should write new files and stat existing files proportional to the count of the new work and the size of the full texts.

Older »