~bzr-pqm/bzr/bzr.dev

« back to all changes in this revision

Viewing changes to doc/developers/inventory.txt

Committer: Andrew Bennetts
Date: 2009-12-03 05:57:41 UTC
mfrom: (4857 +trunk)
mto: This revision was merged to the branch mainline in revision 4869.
Revision ID: andrew.bennetts@canonical.com-20091203055741-vmmg0fmjgjw2pwvu

Merge lp:bzr.

files added:
bzrlib/tests/per_foreign_vcs/test_repository.py

files removed:
bzrlib/textui.py

files modified:
NEWS

bzrlib/_btree_serializer_py.py

bzrlib/_known_graph_py.py

bzrlib/_known_graph_pyx.pyx

bzrlib/_static_tuple_py.py

bzrlib/branch.py

bzrlib/btree_index.py

bzrlib/builtins.py

bzrlib/bundle/__init__.py

bzrlib/bzrdir.py

bzrlib/commands.py

bzrlib/config.py

bzrlib/conflicts.py

bzrlib/export/zip_exporter.py

bzrlib/fetch.py

bzrlib/foreign.py

bzrlib/graph.py

bzrlib/groupcompress.py

bzrlib/help_topics/__init__.py

bzrlib/help_topics/en/conflicts.txt

bzrlib/index.py

bzrlib/knit.py

bzrlib/lockdir.py

bzrlib/log.py

bzrlib/merge.py

bzrlib/merge_directive.py

bzrlib/osutils.py

bzrlib/push.py

bzrlib/repository.py

bzrlib/revision.py

bzrlib/shelf_ui.py

bzrlib/static_tuple.py

bzrlib/tests/__init__.py

bzrlib/tests/blackbox/test_export.py

bzrlib/tests/blackbox/test_ls.py

bzrlib/tests/blackbox/test_merge.py

bzrlib/tests/blackbox/test_push.py

bzrlib/tests/blackbox/test_send.py

bzrlib/tests/blackbox/test_serve.py

bzrlib/tests/http_server.py

bzrlib/tests/per_bzrdir/test_bzrdir.py

bzrlib/tests/per_foreign_vcs/__init__.py

bzrlib/tests/per_intertree/__init__.py

bzrlib/tests/per_workingtree/test_content_filters.py

bzrlib/tests/ssl_certs/create_ssls.py

bzrlib/tests/ssl_certs/server.crt

bzrlib/tests/ssl_certs/server.csr

bzrlib/tests/ssl_certs/server_with_pass.key

bzrlib/tests/ssl_certs/server_without_pass.key

bzrlib/tests/test__known_graph.py

bzrlib/tests/test__static_tuple.py

bzrlib/tests/test_btree_index.py

bzrlib/tests/test_graph.py

bzrlib/tests/test_index.py

bzrlib/tests/test_osutils.py

bzrlib/tests/test_urlutils.py

bzrlib/trace.py

bzrlib/transform.py

bzrlib/tree.py

bzrlib/urlutils.py

bzrlib/util/_bencode_py.py

bzrlib/version.py

bzrlib/workingtree.py

bzrlib/workingtree_4.py

doc/default.css

doc/developers/HACKING.txt

doc/developers/add.txt

doc/developers/api-versioning.txt

doc/developers/apport.txt

doc/developers/authentication-ring.txt

doc/developers/bug-handling.txt

doc/developers/bundles.txt

doc/developers/case-insensitive-file-systems.txt

doc/developers/colocated-branches.txt

doc/developers/commit.txt

doc/developers/container-format.txt

doc/developers/content-filtering.txt

doc/developers/cycle.txt

doc/developers/development-repo.txt

doc/developers/diff.txt

doc/developers/directory-fingerprints.txt

doc/developers/ec2.txt

doc/developers/improved_chk_index.txt

doc/developers/incremental-push-pull.txt

doc/developers/index-plain.txt

doc/developers/index.txt

doc/developers/inventory.txt

doc/developers/last-modified.txt

doc/developers/network-protocol.txt

doc/developers/overview.txt

doc/developers/performance-use-case-analysis.txt

doc/developers/planned-change-integration.txt

doc/developers/planned-performance-changes.txt

doc/developers/plans.txt

doc/developers/plugin-api.txt

doc/developers/ppa.txt

doc/developers/process.txt

doc/developers/profiling.txt

doc/developers/releasing.txt

doc/developers/repository-stream.txt

doc/developers/repository.txt

doc/developers/revert.txt

doc/developers/specifications.txt

doc/developers/status.txt

doc/developers/testing.txt

doc/developers/tortoise-strategy.txt

doc/developers/update.txt

doc/developers/win32_build_setup.txt

doc/en/mini-tutorial/index.txt

doc/en/tutorials/centralized_workflow.txt

doc/en/tutorials/tutorial.txt

doc/en/tutorials/using_bazaar_with_launchpad.txt

doc/en/user-guide/adv_merging.txt

doc/en/user-guide/branching_a_project.txt

doc/en/user-guide/configuring_bazaar.txt

doc/en/user-guide/controlling_registration.txt

doc/en/user-guide/distributed_intro.txt

doc/en/user-guide/http_smart_server.txt

doc/en/user-guide/index-plain.txt

doc/en/user-guide/index.txt

doc/en/user-guide/introducing_bazaar.txt

doc/en/user-guide/plugins.txt

doc/en/user-guide/publishing_a_branch.txt

doc/en/user-guide/recording_changes.txt

doc/en/user-guide/resolving_conflicts.txt

doc/en/user-guide/reviewing_changes.txt

doc/en/user-guide/sending_changes.txt

doc/en/user-guide/server.txt

doc/en/user-guide/setting_up_email.txt

doc/en/user-guide/shared_repository_layouts.txt

doc/en/user-guide/shelving_changes.txt

doc/en/user-guide/specifying_revisions.txt

doc/en/user-guide/stacked.txt

doc/en/user-guide/version_info.txt

doc/en/user-guide/web_browsing.txt

doc/en/user-guide/zen.txt

doc/es/index.txt

doc/es/mini-tutorial/index.txt

doc/es/user-guide/index-plain.txt

doc/es/user-guide/index.txt

doc/es/user-guide/resolving_conflicts.txt

doc/es/user-guide/version_info.txt

doc/index.es.txt

doc/index.ru.txt

doc/ja/tutorials/using_bazaar_with_launchpad.txt

doc/ja/upgrade-guide/data_migration.txt

doc/ja/user-guide/entering_commands.txt

doc/ja/user-guide/http_smart_server.txt

doc/ja/user-guide/introducing_bazaar.txt

doc/ja/user-guide/setting_up_email.txt

doc/ja/user-guide/version_info.txt

doc/ja/user-reference/index.txt

doc/ru/index.txt

doc/ru/mini-tutorial/index.txt

doc/ru/tutorials/centralized_workflow.txt

doc/ru/tutorials/tutorial.txt

doc/ru/tutorials/using_bazaar_with_launchpad.txt

doc/ru/user-guide/branching_a_project.txt

doc/ru/user-guide/index-plain.txt

doc/ru/user-guide/index.txt

doc/ru/user-guide/introducing_bazaar.txt

doc/ru/user-guide/specifying_revisions.txt

doc/ru/user-guide/zen.txt

Show diffs side-by-side

added added

removed removed

doc/developers/inventory.txt

that an inventory is stored as. We have a number of goals we want to achieve:

1. Allow commit to write less than the full tree's data in to the repository

in the general case.

2. Allow the data that is written to be calculated without examining every

versioned path in the tree.

3. Generate the exact same representation for a given inventory regardless of

-----------------

The xml based implementation we use today layers the inventory as a bytestring

which is stored under a single key; the bytestring is then compressed as a

delta against the bytestring of its left hand parent by the knit code.

Gap analysis:

140

-------------------------------------------------------------------------

141

142

* Split up the logical document into smaller serialised fragements. For

143

instance hash buckets or nodes in a tree of some sort. By serialising in

144

smaller units, we can increase the number of smaller units rather than

143

instance hash buckets or nodes in a tree of some sort. By serialising in

144

smaller units, we can increase the number of smaller units rather than

145

their size as the tree grows; as long as two similar trees have similar

146

serialised forms, the amount of different content should be quite high.

147

166

167

* Working tree to arbitrary history revision deltas/comparisons can be scaled

168

up by doing a two-step (fixed at two!) delta combining - delta(tree, basis)

169

and then combine that with delta(basis, arbitrary_revision) using the

169

and then combine that with delta(basis, arbitrary_revision) using the

170

repositories ability to get a delta cheaply.

171

172

* The key primitives we need seem to be:

263

264

The path and content maps are populated simply by serialising every inventory

265

entry and inserting them into both the path map and the content map. The maps

266

start with just a single leaf node with an empty prefix.

266

start with just a single leaf node with an empty prefix.

267

268

269

Apply

439

formerly linked. (This will normally bubble down due to keeping densely

440

packed nodes).

441

To shrink the prefix of a leaf node, create an internal node with the same

442

prefix, then choose a width for the internal node such that the contents

442

prefix, then choose a width for the internal node such that the contents

443

of the leaf all fit into new leaves obeying the min_size and max_size rules.

444

The largest prefix possible should be chosen, to obey the

445

higher-nodes-are-denser rule. That rule also gives room in leaf nodes for

445

higher-nodes-are-denser rule. That rule also gives room in leaf nodes for

446

growth without affecting the parent node packing.

447

#. Update the CHK pointers - serialise every altered node to generate a CHK,

448

and update the CHK placeholder in the nodes parent; then reserialise the

590

591

Different trees can use different algorithms to expand the request as long as

592

they produce consistent deltas. As part of getting a consistent UI we require

593

that all trees expand the paths requested downwards. Beyond that as long as

593

that all trees expand the paths requested downwards. Beyond that as long as

594

the delta is consistent it is up to the tree.

595

596

Given two trees, source and target, and a set of selected file ids to check for

598

the following rules, to get consistent deltas. The test for consistency is that

599

if the resulting delta is applied to source, to create a third tree 'output',

600

and the paths in the delta match the paths in source and output, only one file

601

id is at each path in output, and no file ids are missing parents, then the

601

id is at each path in output, and no file ids are missing parents, then the

602

delta is consistent.

603

604

Firstly, the parent ids to the root for all of the file ids that have actually

Older »