~bzr-pqm/bzr/bzr.dev

« back to all changes in this revision

Viewing changes to doc/developers/fetch.txt

Committer: Jelmer Vernooij
Date: 2011-01-19 22:36:40 UTC
mto: This revision was merged to the branch mainline in revision 5626.
Revision ID: jelmer@samba.org-20110119223640-mugs4t1nbl55tf7c

fix import.

files added:
contrib/add-bzr-to-baz

contrib/newinventory.py

contrib/pwclient.full

contrib/pwk

tools/convertfile.py

tools/convertinv.py

tools/trace-revisions

tools/weavebench.py

files removed:
bzrlib/mergetools.py

bzrlib/tests/blackbox/test_repair_workingtree.py

bzrlib/tests/per_repository_reference/test_graph.py

bzrlib/tests/per_workingtree/test_check_state.py

bzrlib/tests/test_mergetools.py

bzrlib/tests/test_thread.py

bzrlib/thread.py

doc/developers/fetch.txt

doc/en/release-notes/bzr-2.4.txt

doc/en/whats-new/whats-new-in-2.4.txt

files modified:
bzr

bzrlib/__init__.py

bzrlib/branch.py

bzrlib/builtins.py

bzrlib/bundle/__init__.py

bzrlib/bzrdir.py

bzrlib/cmdline.py

bzrlib/commands.py

bzrlib/commit.py

bzrlib/config.py

bzrlib/conflicts.py

bzrlib/controldir.py

bzrlib/crash.py

bzrlib/dirstate.py

bzrlib/errors.py

bzrlib/fetch.py

bzrlib/graph.py

bzrlib/groupcompress.py

bzrlib/help_topics/en/configuration.txt

bzrlib/knit.py

bzrlib/merge.py

bzrlib/msgeditor.py

bzrlib/option.py

bzrlib/osutils.py

bzrlib/plugin.py

bzrlib/plugins/bash_completion/tests/test_bashcomp.py

bzrlib/plugins/launchpad/lp_api.py

bzrlib/plugins/launchpad/lp_directory.py

bzrlib/plugins/launchpad/lp_propose.py

bzrlib/plugins/launchpad/lp_registration.py

bzrlib/plugins/launchpad/test_lp_api.py

bzrlib/plugins/launchpad/test_lp_directory.py

bzrlib/remote.py

bzrlib/repofmt/knitrepo.py

bzrlib/repofmt/pack_repo.py

bzrlib/repofmt/weaverepo.py

bzrlib/repository.py

bzrlib/smart/repository.py

bzrlib/smart/server.py

bzrlib/status.py

bzrlib/tests/__init__.py

bzrlib/tests/blackbox/__init__.py

bzrlib/tests/blackbox/test_annotate.py

bzrlib/tests/blackbox/test_branch.py

bzrlib/tests/blackbox/test_cat_revision.py

bzrlib/tests/blackbox/test_debug.py

bzrlib/tests/blackbox/test_diff.py

bzrlib/tests/blackbox/test_dump_btree.py

bzrlib/tests/blackbox/test_locale.py

bzrlib/tests/blackbox/test_merge.py

bzrlib/tests/blackbox/test_mv.py

bzrlib/tests/blackbox/test_pull.py

bzrlib/tests/blackbox/test_serve.py

bzrlib/tests/blackbox/test_status.py

bzrlib/tests/blackbox/test_tags.py

bzrlib/tests/blackbox/test_upgrade.py

bzrlib/tests/blackbox/test_whoami.py

bzrlib/tests/features.py

bzrlib/tests/per_branch/__init__.py

bzrlib/tests/per_branch/test_bound_sftp.py

bzrlib/tests/per_branch/test_branch.py

bzrlib/tests/per_branch/test_commit.py

bzrlib/tests/per_branch/test_last_revision_info.py

bzrlib/tests/per_branch/test_pull.py

bzrlib/tests/per_branch/test_push.py

bzrlib/tests/per_branch/test_sprout.py

bzrlib/tests/per_controldir/__init__.py

bzrlib/tests/per_controldir/test_controldir.py

bzrlib/tests/per_controldir_colo/test_supported.py

bzrlib/tests/per_controldir_colo/test_unsupported.py

bzrlib/tests/per_interrepository/test_interrepository.py

bzrlib/tests/per_pack_repository.py

bzrlib/tests/per_repository/test_reconcile.py

bzrlib/tests/per_repository/test_repository.py

bzrlib/tests/per_repository_reference/__init__.py

bzrlib/tests/per_repository_reference/test_fetch.py

bzrlib/tests/per_transport.py

bzrlib/tests/per_tree/__init__.py

bzrlib/tests/per_versionedfile.py

bzrlib/tests/per_workingtree/__init__.py

bzrlib/tests/per_workingtree/test_move.py

bzrlib/tests/per_workingtree/test_rename_one.py

bzrlib/tests/per_workingtree/test_workingtree.py

bzrlib/tests/test_branch.py

bzrlib/tests/test_btree_index.py

bzrlib/tests/test_bzrdir.py

bzrlib/tests/test_cmdline.py

bzrlib/tests/test_config.py

bzrlib/tests/test_conflicts.py

bzrlib/tests/test_crash.py

bzrlib/tests/test_dirstate.py

bzrlib/tests/test_http.py

bzrlib/tests/test_index.py

bzrlib/tests/test_inv.py

bzrlib/tests/test_knit.py

bzrlib/tests/test_msgeditor.py

bzrlib/tests/test_options.py

bzrlib/tests/test_osutils.py

bzrlib/tests/test_permissions.py

bzrlib/tests/test_plugins.py

bzrlib/tests/test_remote.py

bzrlib/tests/test_repository.py

bzrlib/tests/test_server.py

bzrlib/tests/test_smart.py

bzrlib/tests/test_test_server.py

bzrlib/tests/test_workingtree.py

bzrlib/tests/transport_util.py

bzrlib/transport/__init__.py

bzrlib/transport/http/__init__.py

bzrlib/transport/http/_urllib2_wrappers.py

bzrlib/transport/pathfilter.py

bzrlib/versionedfile.py

bzrlib/workingtree.py

bzrlib/workingtree_4.py

doc/developers/index.txt

doc/developers/integration.txt

doc/developers/overview.txt

doc/developers/ppa.txt

doc/developers/releasing.txt

doc/developers/testing.txt

doc/en/_templates/index.html

doc/en/index.txt

doc/en/release-notes/bzr-2.2.txt

doc/en/release-notes/bzr-2.3.txt

doc/en/user-guide/configuring_bazaar.txt

doc/en/user-guide/specifying_revisions.txt

doc/en/whats-new/whats-new-in-2.2.txt

doc/en/whats-new/whats-new-in-2.3.txt

Show diffs side-by-side

added added

removed removed

doc/developers/fetch.txt

=============

Fetching data

=============

Overview of a fetch

===================

Inside bzr, a typical fetch happens like this:

* a user runs a command like ``bzr branch`` or ``bzr pull``

* ``Repository.fetch`` is called (by a higher-level method such as

``ControlDir.sprout``, ``Branch.fetch``, etc).

* An ``InterRepository`` object is created. The exact implementation of

``InterRepository`` chosen depends on the format/capabilities of the

source and target repos.

* The source and target repositories are compared to determine which data

needs to be transferred.

* The repository data is copied. Often this is done by creating a

``StreamSource`` and ``StreamSink`` from the source and target

repositories and feeding the stream from the source into the sink, but

some ``InterRepository`` implementations do differently.

How objects to be transferred are determined

============================================

See ``InterRepository._walk_to_common_revisions``. The basic idea is to

do a breadth-first search in the source repository's revision graph

(starting from the head or heads the caller asked for), and look in the

target repository to see if those revisions are already present.

Eventually this will find the common ancestors in both graphs, and thus

the set of revisions to be copied has been identified.

All inventories for the copied revisions need to be present (and all

parent inventories at the stacking boundary too, to support stacking).

All texts versions introduced by those inventories need to be transferred

(but see also stacking constraints).

Fetch specs

===========

The most ``fetch`` methods accept a ``fetch_spec`` parameter. This is how

the caller controls what is fetched: e.g. all revisions for a given head

(that aren't already present in the target), the full ancestry for one or

more heads, or even the full contents of the source repository.

The ``fetch_spec`` parameter is an object that implements the interface

defined by ``AbstractSearchResult`` in ``bzrlib.graph``. It describes

which keys should be fetched. Current implementations are

``SearchResult``, ``PendingAncestryResult``, ``EmptySearchResult``, and

``EverythingResult``. Some have options controlling if missing revisions

cause errors or not, etc.

There are also some “search” objects, which can be used to conveniently

construct a search result for common cases: ``EverythingNotInOther`` and

``NotInOtherForRevs``. They provide an ``execute`` method that performs

the search and returns a search result.

Also, ``Graph._make_breadth_first_searcher`` returns an object with a

``get_result`` method that returns a search result.

Streams

=======

A **stream** is an iterable of (substream type, substream) pairs.

The **substream type** is a ``str`` that will be one of ``texts``,

``inventories``, ``inventory-deltas``, ``chk_bytes``, ``revisions`` or

``signatures``. A **substream** is a record stream. The format of those

records depends on the repository format being streamed, except for

``inventory-deltas`` records which are format-independent.

A stream source can be constructed with ``repo._get_source(to_format)``,

and it provides a ``get_stream(search)`` method (among others). A stream

sink can be constructed with ``repo._get_sink()``, and provides an

``insert_stream(stream, src_format, resume_tokens)`` method (among

others).

vim: ft=rst tw=74 ai

Older »