==================================
Reading and Writing Config Files
==================================
----------------------------------------
ConfigObj 4 Introduction and Reference
----------------------------------------
:Authors: Michael Foord, Nicola Larosa
:Version: ConfigObj 4.5.2
:Date: 2008/02/24
:Homepage: `ConfigObj Homepage`_
:Sourceforge: Sourceforge_
:Development: `SVN Repository`_
:License: `BSD License`_
:Support: `Mailing List`_
.. _Mailing List: http://lists.sourceforge.net/lists/listinfo/configobj-develop
.. _SVN Repository: http://svn.pythonutils.python-hosting.com
.. meta::
:description: ConfigObj - a Python module for easy reading and writing of
config files.
:keywords: python, script, module, config, configuration, data, persistence,
developer, configparser
.. contents:: ConfigObj Manual
.. sectnum::
Introduction
============
**ConfigObj** is a simple but powerful config file reader and writer: an *ini
file round tripper*. Its main feature is that it is very easy to use, with a
straightforward programmer's interface and a simple syntax for config files.
It has lots of other features though :
* Nested sections (subsections), to any level
* List values
* Multiple line values
* String interpolation (substitution)
* Integrated with a powerful validation system
- including automatic type checking/conversion
- repeated sections
- and allowing default values
* When writing out config files, ConfigObj preserves all comments and the order of members and sections
* Many useful methods and options for working with configuration files (like the 'reload' method)
* Full Unicode support
For support and bug reports please use the ConfigObj `Mailing List`_.
Downloading
===========
The current version is **4.5.2**, dated 24th February 2008. ConfigObj 4 is
now stable. We still expect to pick up a few bugs along the way though [#]_.
{sm;:-)}
You can get ConfigObj in the following ways :
Files
-----
* configobj.py_ from Voidspace
ConfigObj has no external dependencies. This file is sufficient to access
all the functionality except Validation_.
* configobj.zip_ from Voidspace
This also contains validate.py_ and `this document`_.
* The latest development version can be obtained from the `Subversion
Repository`_.
* validate.py_ from Voidspace
* You can also download *configobj.zip* from Sourceforge_
Documentation
-------------
*configobj.zip* also contains `this document`_.
* You can view `this document`_ online at the `ConfigObj Homepage`_.
Pythonutils
-----------
ConfigObj is also part of the Pythonutils_ set of modules. This contains
various other useful modules, and is required by many of the `Voidspace Python
Projects`_.
Development Version
-------------------
It is sometimes possible to get the latest *development version* of ConfigObj
from the `Subversion Repository `_.
.. _configobj.py: http://www.voidspace.org.uk/cgi-bin/voidspace/downman.py?file=configobj.py
.. _configobj.zip: http://www.voidspace.org.uk/cgi-bin/voidspace/downman.py?file=configobj-4.5.2.zip
.. _validate.py: http://www.voidspace.org.uk/cgi-bin/voidspace/downman.py?file=validate.py
.. _this document:
.. _configobj homepage: http://www.voidspace.org.uk/python/configobj.html
.. _Sourceforge: http://sourceforge.net/projects/configobj
.. _pythonutils: http://www.voidspace.org.uk/python/pythonutils.html
.. _Voidspace Python Projects: http://www.voidspace.org.uk/python/index.shtml
ConfigObj in the Real World
===========================
**ConfigObj** is widely used. Projects using it include:
* `Bazaar `_.
Bazaar is a Python distributed {acro;VCS;Version Control System}.
ConfigObj is used to read ``bazaar.conf`` and ``branches.conf``.
* `Turbogears `_
Turbogears is a web application framework.
* `Chandler `_
A Python and `wxPython `_
{acro;PIM;Personal Information Manager}, being developed by the
`OSAFoundation `_.
* `matplotlib `_
A 2D plotting library.
* `IPython `_
IPython is an enhanced interactive Python shell. IPython uses ConfigObj in a module called 'TConfig' that combines it with enthought `Traits `_: `tconfig `_.
* `Elisa - the Fluendo Mediacenter `_
Elisa is an open source cross-platform media center solution designed to be simple for people not particularly familiar with computers.
Getting Started
===============
The outstanding feature of using ConfigObj is simplicity. Most functions can be
performed with single line commands.
Reading a Config File
---------------------
The normal way to read a config file, is to give ConfigObj the filename :
.. raw:: html
{+coloring}
from configobj import ConfigObj
config = ConfigObj(filename)
{-coloring}
You can also pass the config file in as a list of lines, or a ``StringIO``
instance, so it doesn't matter where your config data comes from.
You can then access members of your config file as a dictionary. Subsections
will also be dictionaries.
.. raw:: html
{+coloring}
from configobj import ConfigObj
config = ConfigObj(filename)
#
value1 = config['keyword1']
value2 = config['keyword2']
#
section1 = config['section1']
value3 = section1['keyword3']
value4 = section1['keyword4']
#
# you could also write
value3 = config['section1']['keyword3']
value4 = config['section1']['keyword4']
{-coloring}
Writing a Config File
---------------------
Creating a new config file is just as easy as reading one. You can specify a
filename when you create the ConfigObj, or do it later [#]_.
If you *don't* set a filename, then the ``write`` method will return a list of
lines instead of writing to file. See the write_ method for more details.
Here we show creating an empty ConfigObj, setting a filename and some values,
and then writing to file :
.. raw:: html
{+coloring}
from configobj import ConfigObj
config = ConfigObj()
config.filename = filename
#
config['keyword1'] = value1
config['keyword2'] = value2
#
config['section1'] = {}
config['section1']['keyword3'] = value3
config['section1']['keyword4'] = value4
#
section2 = {
'keyword5': value5,
'keyword6': value6,
'sub-section': {
'keyword7': value7
}
}
config['section2'] = section2
#
config['section3'] = {}
config['section3']['keyword 8'] = [value8, value9, value10]
config['section3']['keyword 9'] = [value11, value12, value13]
#
config.write()
{-coloring}
.. caution::
Keywords and section names can only be strings [#]_. Attempting to set
anything else will raise a ``ValueError``.
Config Files
------------
The config files that ConfigObj will read and write are based on the 'INI'
format. This means it will read and write files created for ``ConfigParser``
[#]_.
Keywords and values are separated by an ``'='``, and section markers are
between square brackets. Keywords, values, and section names can be surrounded
by single or double quotes. Indentation is not significant, but can be
preserved.
Subsections are indicated by repeating the square brackets in the section
marker. You nest levels by using more brackets.
You can have list values by separating items with a comma, and values spanning
multiple lines by using triple quotes (single or double).
For full details on all these see `the config file format`_. Here's an example
to illustrate : ::
# This is the 'initial_comment'
# Which may be several lines
keyword1 = value1
'keyword 2' = 'value 2'
[ "section 1" ]
# This comment goes with keyword 3
keyword 3 = value 3
'keyword 4' = value4, value 5, 'value 6'
[[ sub-section ]] # an inline comment
# sub-section is inside "section 1"
'keyword 5' = 'value 7'
'keyword 6' = '''A multiline value,
that spans more than one line :-)
The line breaks are included in the value.'''
[[[ sub-sub-section ]]]
# sub-sub-section is *in* 'sub-section'
# which is in 'section 1'
'keyword 7' = 'value 8'
[section 2] # an inline comment
keyword8 = "value 9"
keyword9 = value10 # an inline comment
# The 'final_comment'
# Which also may be several lines
ConfigObj specifications
========================
.. raw:: html
{+coloring}
config = ConfigObj(infile=None, options=None, **keywargs)
{-coloring}
infile
------
You don't need to specify an infile. If you omit it, an empty ConfigObj will be
created. ``infile`` *can* be :
* Nothing. In which case the ``filename`` attribute of your ConfigObj will be
``None``. You can set a filename at any time.
* A filename. What happens if the file doesn't already exist is determined by
the options_ ``file_error`` and ``create_empty``. The filename will be
preserved as the ``filename`` attribute. This can be changed at any time.
* A list of lines. Any trailing newlines will be removed from the lines. The
``filename`` attribute of your ConfigObj will be ``None``.
* A ``StringIO`` instance or file object, or any object with a ``read`` method.
The ``filename`` attribute of your ConfigObj will be ``None`` [#]_.
* A dictionary. You can initialise a ConfigObj from a dictionary [#]_. The
``filename`` attribute of your ConfigObj will be ``None``. All keys must be
strings. In this case, the order of values and sections is arbitrary.
options
-------
There are various options that control the way ConfigObj behaves. They can be
passed in as a dictionary of options, or as keyword arguments. Explicit keyword
arguments override the dictionary.
All of the options are available as attributes after the config file has been
parsed.
ConfigObj has the following options (with the default values shown) :
* 'raise_errors': ``False``
When parsing, it is possible that the config file will be badly formed. The
default is to parse the whole file and raise a single error at the end. You
can set ``raise_errors = True`` to have errors raised immediately. See the
exceptions_ section for more details.
Altering this value after initial parsing has no effect.
* 'list_values': ``True``
If ``True`` (the default) then list values are possible. If ``False``, the
values are not parsed for lists.
If ``list_values = False`` then single line values are not quoted or
unquoted when reading and writing.
Changing this value affects whether single line values will be quoted or
not when writing.
* 'create_empty': ``False``
If this value is ``True`` and the file specified by ``infile`` doesn't
exist, ConfigObj will create an empty file. This can be a useful test that
the filename makes sense: an impossible filename will cause an error.
Altering this value after initial parsing has no effect.
* 'file_error': ``False``
If this value is ``True`` and the file specified by ``infile`` doesn't
exist, ConfigObj will raise an ``IOError``.
Altering this value after initial parsing has no effect.
* 'interpolation': ``True``
Whether string interpolation is switched on or not. It is on (``True``) by
default.
You can set this attribute to change whether string interpolation is done
when values are fetched. See the `String Interpolation`_ section for more details.
* 'configspec': ``None``
If you want to use the validation system, you supply a configspec. This is
effectively a type of config file that specifies a check for each member.
This check can be used to do type conversion as well as check that the
value is within your required parameters.
You provide a configspec in the same way as you do the initial file: a
filename, or list of lines, etc. See the validation_ section for full
details on how to use the system.
When parsed, every section has a ``configspec`` with a dictionary of
configspec checks for *that section*.
* 'stringify': ``True``
If you use the validation scheme, it can do type checking *and* conversion
for you. This means you may want to set members to integers, or other
non-string values.
If 'stringify' is set to ``True`` (default) then non-string values will
be converted to strings when you write the config file. The validation_
process converts values from strings to the required type.
If 'stringify' is set to ``False``, attempting to set a member to a
non-string value [#]_ will raise a ``TypeError`` (no type conversion is
done by validation).
* 'indent_type': ``' '``
Indentation is not significant; it can however be present in the input and
output config. Any combination of tabs and spaces may be used: the string
will be repeated for each level of indentation. Typical values are: ``''``
(no indentation), ``' '`` (indentation with four spaces, the default),
``'\t'`` (indentation with one tab).
If this option is not specified, and the ConfigObj is initialised with a
dictionary, the indentation used in the output is the default one, that is,
four spaces.
If this option is not specified, and the ConfigObj is initialised with a
list of lines or a file, the indentation used in the first indented line is
selected and used in all output lines. If no input line is indented, no
output line will be either.
If this option *is* specified, the option value is used in the output
config, overriding the type of indentation in the input config (if any).
* 'encoding': ``None``
By default **ConfigObj** does not decode the file/strings you pass it into
Unicode [#]_. If you want your config file as Unicode (keys and members)
you need to provide an encoding to decode the file with. This encoding will
also be used to encode the config file when writing.
You can change the encoding attribute at any time.
Any characters in your strings that can't be encoded with the specified
encoding will raise a ``UnicodeEncodeError``.
.. note::
``UTF16`` encoded files will automatically be detected and decoded,
even if ``encoding`` is ``None``.
This is because it is a 16-bit encoding, and ConfigObj will mangle it
(split characters on byte boundaries) if it parses it without decoding.
* 'default_encoding': ``None``
When using the ``write`` method, **ConfigObj** uses the ``encoding``
attribute to encode the Unicode strings. If any members (or keys) have
been set as byte strings instead of Unicode, these must first be decoded
to Unicode before outputting in the specified encoding.
``default_encoding``, if specified, is the encoding used to decode byte
strings in the **ConfigObj** before writing. If this is ``None``, then
the Python default encoding (``sys.defaultencoding`` - usually ASCII) is
used.
For most Western European users, a value of ``latin-1`` is sensible.
``default_encoding`` is *only* used if an ``encoding`` is specified.
Any characters in byte-strings that can't be decoded using the
``default_encoding`` will raise a ``UnicodeDecodeError``.
* 'unrepr': ``False``
The ``unrepr`` option reads and writes files in a different mode. This
allows you to store and retrieve the basic Python data-types using config
files.
This uses Python syntax for lists and quoting. See `unrepr mode`_ for the
full details.
* 'write_empty_values': ``False``
If ``write_empty_values`` is ``True``, empty strings are written as
empty values. See `Empty Values`_ for more details.
Methods
-------
The ConfigObj is a subclass of an object called ``Section``, which is itself a
subclass of ``dict``, the builtin dictionary type. This means it also has
**all** the normal dictionary methods.
In addition, the following `Section Methods`_ may be useful :
* 'restore_default'
* 'restore_defaults'
* 'walk'
* 'merge'
* 'dict'
* 'as_bool'
* 'as_float'
* 'as_int'
Read about Sections_ for details of all the methods.
.. hint::
The *merge* method of sections is a recursive update.
You can use this to merge sections, or even whole ConfigObjs, into each
other.
You would typically use this to create a default ConfigObj and then merge
in user settings. This way users only need to specify values that are
different from the default. You can use configspecs and validation to
achieve the same thing of course.
The public methods available on ConfigObj are :
* 'write'
* 'validate'
* 'reset'
* 'reload'
write
~~~~~
::
write(file_object=None)
This method writes the current ConfigObj and takes a single, optional argument
[#]_.
If you pass in a file like object to the ``write`` method, the config file will
be written to this. (The only method of this object that is used is its
``write`` method, so a ``StringIO`` instance, or any other file like object
will work.)
Otherwise, the behaviour of this method depends on the ``filename`` attribute
of the ConfigObj.
``filename``
ConfigObj will write the configuration to the file specified.
``None``
``write`` returns a list of lines. (Not ``'\n'`` terminated)
First the 'initial_comment' is written, then the config file, followed by the
'final_comment'. Comment lines and inline comments are written with each
key/value.
validate
~~~~~~~~
::
validate(validator, preserve_errors=False, copy=False)
.. raw:: html
{+coloring}
# filename is the config file
# filename2 is the configspec
# (which could also be hardcoded into your program)
config = ConfigObj(filename, configspec=filename2)
#
from validate import Validator
val = Validator()
test = config.validate(val)
if test == True:
print 'Succeeded.'
{-coloring}
The validate method uses the `validate
`__ module to do the
validation.
This method validates the ConfigObj against the configspec. By doing type
conversion as well it can abstract away the config file altogether and present
the config *data* to your application (in the types it expects it to be).
If the ``configspec`` attribute of the ConfigObj is ``None``, it raises a
``ValueError``.
If the stringify_ attribute is set, this process will convert values to the
type defined in the configspec.
The validate method uses checks specified in the configspec and defined in the
``Validator`` object. It is very easy to extend.
The configspec looks like the config file, but instead of the value, you
specify the check (and any default value). See the validation_ section for
details.
.. hint::
The system of configspecs can seem confusing at first, but is actually
quite simple and powerful. For a concrete example of how to use it, you may
find this blog entry helpful :
`Transforming Values with ConfigObj `_.
The ``copy`` parameter fills in missing values from the configspec (default
values), *without* marking the values as defaults. It also causes comments to
be copied from the configspec into the config file. This allows you to use a
configspec to create default config files. (Normally default values aren't
written out by the ``write`` method.)
As of ConfigObj 4.3.0 you can also pass in a ConfigObj instance as your
configspec. This is especially useful if you need to specify the encoding of
your configspec file. When you read your configspec file, you *must* specify
``list_values=False``.
.. raw:: html
{+coloring}
from configobj import ConfigObj
configspec = ConfigObj(configspecfilename, encoding='UTF8',
list_values=False)
config = ConfigObj(filename, configspec=configspec)
{-coloring}
Return Value
############
By default, the validate method either returns ``True`` (everything passed)
or a dictionary of ``True``/``False`` representing pass/fail. The dictionary
follows the structure of the ConfigObj.
If a whole section passes then it is replaced with the value ``True``. If a
whole section fails, then it is replaced with the value ``False``.
If a value is missing, and there is no default in the check, then the check
automatically fails.
The ``validate`` method takes an optional keyword argument ``preserve_errors``.
If you set this to ``True``, instead of getting ``False`` for failed checks you
get the actual error object from the **validate** module. This usually contains
useful information about why the check failed.
See the `flatten_errors`_ function for how to turn your results dictionary into
a useful list of error messages.
Even if ``preserve_errors`` is ``True``, missing keys or sections will still be
represented by a ``False`` in the results dictionary.
Mentioning Default Values
#########################
In the check in your configspec, you can specify a default to be used - by
using the ``default`` keyword. E.g. ::
key1 = integer(0, 30, default=15)
key2 = integer(default=15)
key3 = boolean(default=True)
key4 = option('Hello', 'Goodbye', 'Not Today', default='Not Today')
If the configspec check supplies a default and the value is missing in the
config, then the default will be set in your ConfigObj. (It is still passed to
the ``Validator`` so that type conversion can be done: this means the default
value must still pass the check.)
ConfigObj keeps a record of which values come from defaults, using the
``defaults`` attribute of sections_. Any key in this list isn't written out by
the ``write`` method. If a key is set from outside (even to the same value)
then it is removed from the ``defaults`` list.
.. note:
Even if all the keys in a section are in the defaults list, the section
marker is still written out.
There is additionally a special case default value of ``None``. If you set the
default value to ``None`` and the value is missing, the value will always be
set to ``None``. As the other checks don't return ``None`` (unless you
implement your own that do), you can tell that this value came from a default
value (and was missing from the config file). It allows an easy way of
implementing optional values. Simply check (and ignore) members that are set
to ``None``.
.. note::
If stringify_ is ``False`` then ``default=None`` returns ``''`` instead of
``None``. This is because setting a value to a non-string raises an error
if stringify is unset.
The default value can be a list. See `List Values`_ for the way to do this.
Writing invalid default values is a *guaranteed* way of confusing your users.
Default values **must** pass the check.
Mentioning Repeated Sections
############################
In the configspec it is possible to cause *every* sub-section in a section to
be validated using the same configspec. You do this with a section in the
configspec called ``__many__``. Every sub-section in that section has the
``__many__`` configspec applied to it (without you having to explicitly name
them in advance).
If you define a ``__many__`` type section it must the only sub-section in that
section. Having a ``__many__`` *and* other sub-sections defined in the same
section will raise a ``RepeatSectionError``.
Your ``__many__`` section can have nested subsections, which can also include
``__many__`` type sections.
See `Repeated Sections`_ for examples.
Mentioning SimpleVal
####################
If you just want to check if all members are present, then you can use the
``SimpleVal`` object that comes with ConfigObj. It only fails members if they
are missing.
Write a configspec that has all the members you want to check for, but set
every section to ``''``.
.. raw:: html
{+coloring}
val = SimpleVal()
test = config.validate(val)
if test is True:
print 'Succeeded.'
{-coloring}
Mentioning copy Mode
####################
As discussed in `Mentioning Default Values`_, you can use a configspec to
supply default values. These are marked in the ConfigObj instance as defaults,
and *not* written out by the ``write`` mode. This means that your users only
need to supply values that are different from the defaults.
This can be inconvenient if you *do* want to write out the default values,
for example to write out a default config file.
If you set ``copy=True`` when you call validate, then no values are marked as
defaults. In addition, all comments from the configspec are copied into
your ConfigObj instance. You can then call ``write`` to create your config
file.
There is a limitation with this. In order to allow `String Interpolation`_ to work
within configspecs, ``DEFAULT`` sections are not processed by
validation; even in copy mode.
reload
~~~~~~
If a ConfigObj instance was loaded from the filesystem, then this method will reload it. It
will also reuse any configspec you supplied at instantiation (including reloading it from
the filesystem if you passed it in as a filename).
If the ConfigObj does not have a filename attribute pointing to a file, then a ``ReloadError``
will be raised.
reset
~~~~~
This method takes no arguments and doesn't return anything. It restores a ConfigObj
instance to a freshly created state.
Attributes
----------
A ConfigObj has the following attributes :
* indent_type
* interpolate
* stringify
* BOM
* initial_comment
* final_comment
* list_values
* encoding
* default_encoding
* unrepr
* write_empty_values
* newlines
.. note::
This doesn't include *comments*, *inline_comments*, *defaults*, or
*configspec*. These are actually attributes of Sections_.
It also has the following attributes as a result of parsing. They correspond to
options_ when the ConfigObj was created, but changing them has no effect.
* raise_errors
* create_empty
* file_error
interpolation
~~~~~~~~~~~~~
ConfigObj can perform string interpolation in a *similar* way to
``ConfigParser``. See the `String Interpolation`_ section for full details.
If ``interpolation`` is set to ``False``, then interpolation is *not* done when
you fetch values.
stringify
~~~~~~~~~
If this attribute is set (``True``) then the validate_ method changes the
values in the ConfigObj. These are turned back into strings when write_ is
called.
If stringify is unset (``False``) then attempting to set a value to a non
string (or a list of strings) will raise a ``TypeError``.
BOM
~~~
If the initial config file *started* with the UTF8 Unicode signature (known
slightly incorrectly as the {acro;BOM;Byte Order Mark}), or the UTF16 BOM, then
this attribute is set to ``True``. Otherwise it is ``False``.
If it is set to ``True`` when ``write`` is called then, if ``encoding`` is set
to ``None`` *or* to ``utf_8`` (and variants) a UTF BOM will be written.
For UTF16 encodings, a BOM is *always* written.
initial_comment
~~~~~~~~~~~~~~~
This is a list of lines. If the ConfigObj is created from an existing file, it
will contain any lines of comments before the start of the members.
If you create a new ConfigObj, this will be an empty list.
The write method puts these lines before it starts writing out the members.
final_comment
~~~~~~~~~~~~~
This is a list of lines. If the ConfigObj is created from an existing file, it
will contain any lines of comments after the last member.
If you create a new ConfigObj, this will be an empty list.
The ``write`` method puts these lines after it finishes writing out the
members.
list_values
~~~~~~~~~~~
This attribute is ``True`` or ``False``. If set to ``False`` then values are
not parsed for list values. In addition single line values are not unquoted.
This allows you to do your own parsing of values. It exists primarily to
support the reading of the configspec_ - but has other use cases.
For example you could use the ``LineParser`` from the
`listquote module `_
to read values for nested lists.
Single line values aren't quoted when writing - but multiline values are
handled as normal.
.. caution::
Because values aren't quoted, leading or trailing whitespace can be
lost.
This behaviour was changed in version 4.0.1.
Prior to this, single line values might have been quoted; even with
``list_values=False``. This means that files written by **ConfigObj**
*could* now be incompatible - and need the quotes removing by hand.
encoding
~~~~~~~~
This is the encoding used to encode the output, when you call ``write``. It
must be a valid encoding `recognised by Python `_.
If this value is ``None`` then no encoding is done when ``write`` is called.
default_encoding
~~~~~~~~~~~~~~~~
If encoding is set, any byte-strings in your ConfigObj instance (keys or
members) will first be decoded to Unicode using the encoding specified by the
``default_encoding`` attribute. This ensures that the output is in the encoding
specified.
If this value is ``None`` then ``sys.defaultencoding`` is used instead.
unrepr
~~~~~~
Another boolean value. If this is set, then ``repr(value)`` is used to write
values. This writes values in a slightly different way to the normal ConfigObj
file syntax.
This preserves basic Python data-types when read back in. See `unrepr mode`_
for more details.
write_empty_values
~~~~~~~~~~~~~~~~~~
Also boolean. If set, values that are an empty string (``''``) are written as
empty values. See `Empty Values`_ for more details.
newlines
~~~~~~~~
When a config file is read, ConfigObj records the type of newline separators in the
file and uses this separator when writing. It defaults to ``None``, and ConfigObj
uses the system default (``os.sep``) if write is called without newlines having
been set.
The Config File Format
======================
You saw an example config file in the `Config Files`_ section. Here is a fuller
specification of the config files used and created by ConfigObj.
The basic pattern for keywords is : ::
# comment line
# comment line
keyword = value # inline comment
Both keyword and value can optionally be surrounded in quotes. The equals sign
is the only valid divider.
Values can have comments on the lines above them, and an inline comment after
them. This, of course, is optional. See the comments_ section for details.
If a keyword or value starts or ends with whitespace, or contains a quote mark
or comma, then it should be surrounded by quotes. Quotes are not necessary if
whitespace is surrounded by non-whitespace.
Values can also be lists. Lists are comma separated. You indicate a single
member list by a trailing comma. An empty list is shown by a single comma : ::
keyword1 = value1, value2, value3
keyword2 = value1, # a single member list
keyword3 = , # an empty list
Values that contain line breaks (multi-line values) can be surrounded by triple
quotes. These can also be used if a value contains both types of quotes. List
members cannot be surrounded by triple quotes : ::
keyword1 = ''' A multi line value
on several
lines''' # with a comment
keyword2 = '''I won't be "afraid".'''
#
keyword3 = """ A multi line value
on several
lines""" # with a comment
keyword4 = """I won't be "afraid"."""
.. warning::
There is no way of safely quoting values that contain both types of triple
quotes.
A line that starts with a '#', possibly preceded by whitespace, is a comment.
New sections are indicated by a section marker line. That is the section name
in square brackets. Whitespace around the section name is ignored. The name can
be quoted with single or double quotes. The marker can have comments before it
and an inline comment after it : ::
# The First Section
[ section name 1 ] # first section
keyword1 = value1
# The Second Section
[ "section name 2" ] # second section
keyword2 = value2
Any subsections (sections that are *inside* the current section) are
designated by repeating the square brackets before and after the section name.
The number of square brackets represents the nesting level of the sub-section.
Square brackets may be separated by whitespace; such whitespace, however, will
not be present in the output config written by the ``write`` method.
Indentation is not significant, but can be preserved. See the description of
the ``indent_type`` option, in the `ConfigObj specifications`_ chapter, for the
details.
A *NestingError* will be raised if the number of the opening and the closing
brackets in a section marker is not the same, or if a sub-section's nesting
level is greater than the nesting level of it parent plus one.
In the outer section, single values can only appear before any sub-section.
Otherwise they will belong to the sub-section immediately before them. ::
# initial comment
keyword1 = value1
keyword2 = value2
[section 1]
keyword1 = value1
keyword2 = value2
[[sub-section]]
# this is in section 1
keyword1 = value1
keyword2 = value2
[[[nested section]]]
# this is in sub section
keyword1 = value1
keyword2 = value2
[[sub-section2]]
# this is in section 1 again
keyword1 = value1
keyword2 = value2
[[sub-section3]]
# this is also in section 1, indentation is misleading here
keyword1 = value1
keyword2 = value2
# final comment
When parsed, the above config file produces the following data structure :
.. raw:: html
{+coloring}
ConfigObj({
'keyword1': 'value1',
'keyword2': 'value2',
'section 1': {
'keyword1': 'value1',
'keyword2': 'value2',
'sub-section': {
'keyword1': 'value1',
'keyword2': 'value2',
'nested section': {
'keyword1': 'value1',
'keyword2': 'value2',
},
},
'sub-section2': {
'keyword1': 'value1',
'keyword2': 'value2',
},
'sub-section3': {
'keyword1': 'value1',
'keyword2': 'value2',
},
},
})
{-coloring}
Sections are ordered: note how the structure of the resulting ConfigObj is in
the same order as the original file.
.. note::
In ConfigObj 4.3.0 *empty values* became valid syntax. They are read as the
empty string. There is also an option/attribute (``write_empty_values``) to
allow the writing of these.
This is mainly to support 'legacy' config files, written from other
applications. This is documented under `Empty Values`_.
`unrepr mode`_ introduces *another* syntax variation, used for storing
basic Python datatypes in config files. {sm;:-)}
Sections
========
Every section in a ConfigObj has certain properties. The ConfigObj itself also
has these properties, because it too is a section (sometimes called the *root
section*).
``Section`` is a subclass of the standard new-class dictionary, therefore it
has **all** the methods of a normal dictionary. This means you can ``update``
and ``clear`` sections.
.. note::
You create a new section by assigning a member to be a dictionary.
The new ``Section`` is created *from* the dictionary, but isn't the same
thing as the dictionary. (So references to the dictionary you use to create
the section *aren't* references to the new section).
Note the following.
.. raw:: html
{+coloring}
config = ConfigObj()
vals = {'key1': 'value 1',
'key2': 'value 2'
}
config['vals'] = vals
config['vals'] == vals
True
config['vals'] is vals
False
{-coloring}
If you now change ``vals``, the changes won't be reflected in ``config['vals']``.
A section is ordered, following its ``scalars`` and ``sections``
attributes documented below. This means that the following dictionary
attributes return their results in order.
* '__iter__'
More commonly known as ``for member in section:``.
* '__repr__' and '__str__'
Any time you print or display the ConfigObj.
* 'items'
* 'iteritems'
* 'iterkeys'
* 'itervalues'
* 'keys'
* 'popitem'
* 'values'
Section Attributes
------------------
* main
A reference to the main ConfigObj.
* parent
A reference to the 'parent' section, the section that this section is a
member of.
On the ConfigObj this attribute is a reference to itself. You can use this
to walk up the sections, stopping when ``section.parent is section``.
* depth
The nesting level of the current section.
If you create a new ConfigObj and add sections, 1 will be added to the
depth level between sections.
* defaults
This attribute is a list of scalars that came from default values. Values
that came from defaults aren't written out by the ``write`` method.
Setting any of these values in the section removes them from the defaults
list.
* default_values
This attribute is a dictionary mapping keys to the default values for the
keys. By default it is an empty dictionary and is populated when you
validate the ConfigObj.
* scalars, sections
These attributes are normal lists, representing the order that members,
single values and subsections appear in the section. The order will either
be the order of the original config file, *or* the order that you added
members.
The order of members in this lists is the order that ``write`` creates in
the config file. The ``scalars`` list is output before the ``sections``
list.
Adding or removing members also alters these lists. You can manipulate the
lists directly to alter the order of members.
.. warning::
If you alter the ``scalars``, ``sections``, or ``defaults`` attributes
so that they no longer reflect the contents of the section, you will
break your ConfigObj.
See also the ``rename`` method.
* comments
This is a dictionary of comments associated with each member. Each entry is
a list of lines. These lines are written out before the member.
* inline_comments
This is *another* dictionary of comments associated with each member. Each
entry is a string that is put inline with the member.
* configspec
The configspec attribute is a dictionary mapping scalars to *checks*. A
check defines the expected type and possibly the allowed values for a
member.
The configspec has the same format as a config file, but instead of values
it has a specification for the value (which may include a default value).
The validate_ method uses it to check the config file makes sense. If a
configspec is passed in when the ConfigObj is created, then it is parsed
and broken up to become the ``configspec`` attribute of each section.
If you didn't pass in a configspec, this attribute will be ``None`` on the
root section (the main ConfigObj).
You can set the configspec attribute directly on a section.
See the validation_ section for full details of how to write configspecs.
Section Methods
---------------
* **dict**
This method takes no arguments. It returns a deep copy of the section as a
dictionary. All subsections will also be dictionaries, and list values will
be copies, rather than references to the original [#]_.
* **rename**
``rename(oldkey, newkey)``
This method renames a key, without affecting its position in the sequence.
It is mainly implemented for the ``encode`` and ``decode`` methods, which
provide some Unicode support.
* **merge**
``merge(indict)``
This method is a *recursive update* method. It allows you to merge two
config files together.
You would typically use this to create a default ConfigObj and then merge
in user settings. This way users only need to specify values that are
different from the default.
For example :
.. raw:: html
{+coloring}
# def_cfg contains your default config settings
# user_cfg contains the user settings
cfg = ConfigObj(def_cfg)
usr = ConfigObj(user_cfg)
#
cfg.merge(usr)
"""
cfg now contains a combination of the default settings and the user
settings.
The user settings will have overwritten any of the default ones.
"""
{-coloring}
* **walk**
This method can be used to transform values and names. See `walking a
section`_ for examples and explanation.
* **decode**
``decode(encoding)``
This method decodes names and values into Unicode objects, using the
supplied encoding.
* **encode**
``encode(encoding)``
This method is the opposite of ``decode`` {sm;:!:}.
It encodes names and values using the supplied encoding. If any of your
names/values are strings rather than Unicode, Python will have to do an
implicit decode first. (This method uses ``sys.defaultencoding`` for
implicit decodes.)
* **as_bool**
``as_bool(key)``
Returns ``True`` if the key contains a string that represents ``True``, or
is the ``True`` object.
Returns ``False`` if the key contains a string that represents ``False``,
or is the ``False`` object.
Raises a ``ValueError`` if the key contains anything else.
Strings that represent ``True`` are (not case sensitive) : ::
true, yes, on, 1
Strings that represent ``False`` are : ::
false, no, off, 0
.. note::
In ConfigObj 4.1.0, this method was called ``istrue``. That method is
now deprecated and will issue a warning when used. It will go away
in a future release.
* **as_int**
``as_int(key)``
This returns the value contained in the specified key as an integer.
It raises a ``ValueError`` if the conversion can't be done.
* **as_float**
``as_float(key)``
This returns the value contained in the specified key as a float.
It raises a ``ValueError`` if the conversion can't be done.
* **restore_default**
``restore_default(key)``
Restore (and return) the default value for the specified key.
This method will only work for a ConfigObj that was created
with a configspec and has been validated.
If there is no default value for this key, ``KeyError`` is raised.
* **restore_defaults**
``restore_defaults()``
Recursively restore default values to all members
that have them.
This method will only work for a ConfigObj that was created
with a configspec and has been validated.
It doesn't delete or modify entries without default values.
Walking a Section
-----------------
.. note::
The walk method allows you to call a function on every member/name.
.. raw:: html
{+coloring}
walk(function, raise_errors=True,
call_on_sections=False, **keywargs):
{-coloring}
``walk`` is a method of the ``Section`` object. This means it is also a method
of ConfigObj.
It walks through every member and calls a function on the keyword and value. It
walks recursively through subsections.
It returns a dictionary of all the computed values.
If the function raises an exception, the default is to propagate the error, and
stop. If ``raise_errors=False`` then it sets the return value for that keyword
to ``False`` instead, and continues. This is similar to the way validation_
works.
Your function receives the arguments ``(section, key)``. The current value is
then ``section[key]`` [#]_. Any unrecognised keyword arguments you pass to
walk, are passed on to the function.
Normally ``walk`` just recurses into subsections. If you are transforming (or
checking) names as well as values, then you want to be able to change the names
of sections. In this case set ``call_on_sections`` to ``True``. Now, on
encountering a sub-section, *first* the function is called for the *whole*
sub-section, and *then* it recurses into it's members. This means your function
must be able to handle receiving dictionaries as well as strings and lists.
If you are using the return value from ``walk`` *and* ``call_on_sections``,
note that walk discards the return value when it calls your function.
.. caution::
You can use ``walk`` to transform the names of members of a section
but you mustn't add or delete members.
Examples
--------
Examples that use the walk method are the ``encode`` and ``decode`` methods.
They both define a function and pass it to walk. Because these functions
transform names as well as values (from byte strings to Unicode) they set
``call_on_sections=True``.
To see how they do it, *read the source Luke* {sm;:cool:}.
You can use this for transforming all values in your ConfigObj. For example
you might like the nested lists from ConfigObj 3. This was provided by the
listquote_ module. You could switch off the parsing for list values
(``list_values=False``) and use listquote to parse every value.
Another thing you might want to do is use the Python escape codes in your
values. You might be *used* to using ``\n`` for line feed and ``\t`` for tab.
Obviously we'd need to decode strings that come from the config file (using the
escape codes). Before writing out we'll need to put the escape codes back in
encode.
As an example we'll write a function to use with walk, that encodes or decodes
values using the ``string-escape`` codec.
The function has to take each value and set the new value. As a bonus we'll
create one function that will do decode *or* encode depending on a keyword
argument.
We don't want to work with section names, we're only transforming values, so
we can leave ``call_on_sections`` as ``False``. This means the two datatypes we
have to handle are strings and lists, we can ignore everything else. (We'll
treat tuples as lists as well).
We're not using the return values, so it doesn't need to return anything, just
change the values if appropriate.
.. raw:: html
{+coloring}
def string_escape(section, key, encode=False):
"""
A function to encode or decode using the 'string-escape' codec.
To be passed to the walk method of a ConfigObj.
By default it decodes.
To encode, pass in the keyword argument ``encode=True``.
"""
val = section[key]
# is it a type we can work with
# NOTE: for platforms where Python > 2.2
# you can use basestring instead of (str, unicode)
if not isinstance(val, (str, unicode, list, tuple)):
# no !
return
elif isinstance(val, (str, unicode)):
# it's a string !
if not encode:
section[key] = val.decode('string-escape')
else:
section[key] = val.encode('string-escape')
else:
# it must be a list or tuple!
# we'll be lazy and create a new list
newval = []
# we'll check every member of the list
for entry in val:
if isinstance(entry, (str, unicode)):
if not encode:
newval.append(entry.decode('string-escape'))
else:
newval.append(entry.encode('string-escape'))
else:
newval.append(entry)
# done !
section[key] = newval
# assume we have a ConfigObj called ``config``
#
# To decode
config.walk(string_escape)
#
# To encode.
# Because ``walk`` doesn't recognise the ``encode`` argument
# it passes it to our function.
config.walk(string_escape, encode=True)
{-coloring}
Here's a simple example of using ``walk`` to transform names and values. One
usecase of this would be to create a *standard* config file with placeholders
for section and keynames. You can then use walk to create new config files
and change values and member names :
.. raw:: html
{+coloring}
# We use 'XXXX' as a placeholder
config = '''
XXXXkey1 = XXXXvalue1
XXXXkey2 = XXXXvalue2
XXXXkey3 = XXXXvalue3
[XXXXsection1]
XXXXkey1 = XXXXvalue1
XXXXkey2 = XXXXvalue2
XXXXkey3 = XXXXvalue3
[XXXXsection2]
XXXXkey1 = XXXXvalue1
XXXXkey2 = XXXXvalue2
XXXXkey3 = XXXXvalue3
[[XXXXsection1]]
XXXXkey1 = XXXXvalue1
XXXXkey2 = XXXXvalue2
XXXXkey3 = XXXXvalue3
'''.splitlines()
cfg = ConfigObj(config)
#
def transform(section, key):
val = section[key]
newkey = key.replace('XXXX', 'CLIENT1')
section.rename(key, newkey)
if isinstance(val, (tuple, list, dict)):
pass
else:
val = val.replace('XXXX', 'CLIENT1')
section[newkey] = val
#
cfg.walk(transform, call_on_sections=True)
print cfg
ConfigObj({'CLIENT1key1': 'CLIENT1value1', 'CLIENT1key2': 'CLIENT1value2',
'CLIENT1key3': 'CLIENT1value3',
'CLIENT1section1': {'CLIENT1key1': 'CLIENT1value1',
'CLIENT1key2': 'CLIENT1value2', 'CLIENT1key3': 'CLIENT1value3'},
'CLIENT1section2': {'CLIENT1key1': 'CLIENT1value1',
'CLIENT1key2': 'CLIENT1value2', 'CLIENT1key3': 'CLIENT1value3',
'CLIENT1section1': {'CLIENT1key1': 'CLIENT1value1',
'CLIENT1key2': 'CLIENT1value2', 'CLIENT1key3': 'CLIENT1value3'}}})
{-coloring}
Exceptions
==========
There are several places where ConfigObj may raise exceptions (other than
because of bugs).
1) If a configspec filename you pass in doesn't exist, or a config file
filename doesn't exist *and* ``file_error=True``, an ``IOError`` will be
raised.
2) If you try to set a non-string key, or a non string value when
``stringify=False``, a ``TypeError`` will be raised.
3) A badly built config file will cause parsing errors.
4) A parsing error can also occur when reading a configspec.
5) In string interpolation you can specify a value that doesn't exist, or
create circular references (recursion).
6) If you have a ``__many__`` repeated section with other section definitions
(in a configspec), a ``RepeatSectionError`` will be raised.
Number 5 (which is actually two different types of exceptions) is documented
in `String Interpolation`_.
Number 6 is explained in the validation_ section.
*This* section is about errors raised during parsing.
The base error class is ``ConfigObjError``. This is a subclass of
``SyntaxError``, so you can trap for ``SyntaxError`` without needing to
directly import any of the ConfigObj exceptions.
The following other exceptions are defined (all deriving from
``ConfigObjError``) :
* ``NestingError``
This error indicates either a mismatch in the brackets in a section marker,
or an excessive level of nesting.
* ``ParseError``
This error indicates that a line is badly written. It is neither a valid
``key = value`` line, nor a valid section marker line, nor a comment line.
* ``DuplicateError``
The keyword or section specified already exists.
* ``ConfigspecError``
An error occurred whilst parsing a configspec.
* ``UnreprError``
An error occurred when parsing a value in `unrepr mode`_.
* ``ReloadError``
``reload`` was called on a ConfigObj instance that doesn't have a valid
filename attribute.
When parsing a configspec, ConfigObj will stop on the first error it
encounters. It will raise a ``ConfigspecError``. This will have an ``error``
attribute, which is the actual error that was raised.
Behaviour when parsing a config file depends on the option ``raise_errors``.
If ConfigObj encounters an error while parsing a config file:
If ``raise_errors=True`` then ConfigObj will raise the appropriate error
and parsing will stop.
If ``raise_errors=False`` (the default) then parsing will continue to the
end and *all* errors will be collected.
If ``raise_errors`` is False and multiple errors are found a ``ConfigObjError``
is raised. The error raised has a ``config`` attribute, which is the parts of
the ConfigObj that parsed successfully. It also has an attribute ``errors``,
which is a list of *all* the errors raised. Each entry in the list is an
instance of the appropriate error type. Each one has the following attributes
(useful for delivering a sensible error message to your user) :
* ``line``: the original line that caused the error.
* ``line_number``: its number in the config file.
* ``message``: the error message that accompanied the error.
If only one error is found, then that error is re-raised. The error still has
the ``config`` and ``errors`` attributes. This means that your error handling
code can be the same whether one error is raised in parsing , or several.
It also means that in the most common case (a single error) a useful error
message will be raised.
.. note::
One wrongly written line could break the basic structure of your config
file. This could cause every line after it to flag an error, so having a
list of all the lines that caused errors may not be as useful as it sounds.
{sm;:-(}.
Validation
==========
.. hint::
The system of configspecs can seem confusing at first, but is actually
quite simple and powerful. For a concrete example of how to use it, you may
find this blog entry helpful :
`Transforming Values with ConfigObj `_.
Validation is done through a combination of the configspec_ and a ``Validator``
object. For this you need *validate.py* [#]_. See downloading_ if you don't
have a copy.
Validation can perform two different operations :
1) Check that a value meets a specification. For example, check that a value
is an integer between one and six, or is a choice from a specific set of
options.
2) It can convert the value into the type required. For example, if one of
your values is a port number, validation will turn it into an integer for
you.
So validation can act as a transparent layer between the datatypes of your
application configuration (boolean, integers, floats, etc) and the text format
of your config file.
configspec
----------
The ``validate`` method checks members against an entry in the configspec. Your
configspec therefore resembles your config file, with a check for every member.
In order to perform validation you need a ``Validator`` object. This has
several useful built-in check functions. You can also create your own custom
functions and register them with your Validator object.
Each check is the name of one of these functions, including any parameters and
keyword arguments. The configspecs look like function calls, and they map to
function calls.
The basic datatypes that an un-extended Validator can test for are :
* boolean values (True and False)
* integers (including minimum and maximum values)
* floats (including min and max)
* strings (including min and max length)
* IP addresses (v4 only)
It can also handle lists of these types and restrict a value to being one from
a set of options.
An example configspec is going to look something like : ::
port = integer(0, 100)
user = string(max=25)
mode = option('quiet', 'loud', 'silent')
You can specify default values, and also have the same configspec applied to
several sections. This is called `repeated sections`_.
For full details on writing configspecs, please refer to the `validate.py
documentation`_.
.. important::
Your configspec is read by ConfigObj in the same way as a config file.
That means you can do interpolation *within* your configspec.
In order to allow this, checks in the 'DEFAULT' section (of the root level
of your configspec) are *not* used.
If you need to specify the encoding of your configspec, then you can pass in a
ConfigObj instance as your configspec. When you read your configspec file, you
*must* specify ``list_values=False``.
.. raw:: html
{+coloring}
from configobj import ConfigObj
configspec = ConfigObj(configspecfilename, encoding='UTF8',
list_values=False)
config = ConfigObj(filename, configspec=configspec)
{-coloring}
.. _validate.py documentation: http://www.voidspace.org.uk/python/validate.html
Type Conversion
---------------
By default, validation does type conversion. This means that if you specify
``integer`` as the check, then calling validate_ will actually change the value
to an integer (so long as the check succeeds).
It also means that when you call the write_ method, the value will be converted
back into a string using the ``str`` function.
To switch this off, and leave values as strings after validation, you need to
set the stringify_ attribute to ``False``. If this is the case, attempting to
set a value to a non-string will raise an error.
Default Values
--------------
You can set a default value in your check. If the value is missing from the
config file then this value will be used instead. This means that your user
only has to supply values that differ from the defaults.
If you *don't* supply a default then for a value to be missing is an error,
and this will show in the `return value`_ from validate.
Additionally you can set the default to be ``None``. This means the value will
be set to ``None`` (the object) *whichever check is used*. (It will be set to
``''`` rather than ``None`` if stringify_ is ``False``). You can use this
to easily implement optional values in your config files. ::
port = integer(0, 100, default=80)
user = string(max=25, default=0)
mode = option('quiet', 'loud', 'silent', default='loud')
nick = string(default=None)
.. note::
Because the default goes through type conversion, it also has to pass the
check.
Note that ``default=None`` is case sensitive.
List Values
~~~~~~~~~~~
It's possible that you will want to specify a list as a default value. To avoid
confusing syntax with commas and quotes you use a list constructor to specify
that keyword arguments are lists. This includes the ``default`` value. This
makes checks look something like : ::
checkname(default=list('val1', 'val2', 'val3'))
This works with all keyword arguments, but is most useful for default values.
Repeated Sections
-----------------
Repeated sections are a way of specifying a configspec for a section that
should be applied to *all* subsections in the same section.
The easiest way of explaining this is to give an example. Suppose you have a
config file that describes a dog. That dog has various attributes, but it can
also have many fleas. You don't know in advance how many fleas there will be,
or what they will be called, but you want each flea validated against the same
configspec.
We can define a section called *fleas*. We want every flea in that section
(every sub-section) to have the same configspec applied to it. We do this by
defining a single section called ``__many__``. ::
[dog]
name = string(default=Rover)
age = float(0, 99, default=0)
[[fleas]]
[[[__many__]]]
bloodsucker = boolean(default=True)
children = integer(default=10000)
size = option(small, tiny, micro, default=tiny)
Every flea on our dog will now be validated using the ``__many__`` configspec.
If you define another sub-section in a section *as well as* a ``__many__`` then
you will get an error.
``__many__`` sections can have sub-sections, including their own ``__many__``
sub-sections. Defaults work in the normal way in repeated sections.
Copy Mode
---------
Because you can specify default values in your configspec, you can use
ConfigObj to write out default config files for your application.
However, normally values supplied from a default in a configspec are *not*
written out by the ``write`` method.
To do this, you need to specify ``copy=True`` when you call validate. As well
as not marking values as default, all the comments in the configspec file
will be copied into your ConfigObj instance.
.. raw:: html
{+coloring}
from configobj import ConfigObj
from validate import Validator
vdt = Validator()
config = ConfigObj(configspec='default.ini')
config.filename = 'new_default.ini'
config.validate(vdt, copy=True)
config.write()
{-coloring}
Validation and Interpolation
----------------------------
String interpolation and validation don't play well together. When validation
changes type it sets the value. If the value uses interpolation, then the
interpolation reference would normally be overwritten. Calling ``write`` would
then use the absolute value and the interpolation reference would be lost.
As a compromise - if the value is unchanged by validation then it is not reset.
This means strings that pass through validation unmodified will not be
overwritten. If validation changes type - the value has to be overwritten, and
any interpolation references are lost {sm;:-(}.
SimpleVal
---------
You may not need a full validation process, but still want to check if all the
expected values are present.
Provided as part of the ConfigObj module is the ``SimpleVal`` object. This has
a dummy ``test`` method that always passes.
The only reason a test will fail is if the value is missing. The return value
from ``validate`` will either be ``True``, meaning all present, or a dictionary
with ``False`` for all missing values/sections.
To use it, you still need to pass in a valid configspec when you create the
ConfigObj, but just set all the values to ``''``. Then create an instance of
``SimpleVal`` and pass it to the ``validate`` method.
As a trivial example if you had the following config file : ::
# config file for an application
port = 80
protocol = http
domain = voidspace
top_level_domain = org.uk
You would write the following configspec : ::
port = ''
protocol = ''
domain = ''
top_level_domain = ''
.. raw:: html
{+coloring}
config = Configobj(filename, configspec=configspec)
val = SimpleVal()
test = config.validate(val)
if test == True:
print 'All values present.'
elif test == False:
print 'No values present!'
else:
for entry in test:
if test[entry] == False:
print '"%s" missing.' % entry
{-coloring}
Empty values
============
Many config files from other applications allow empty values. As of version
4.3.0, ConfigObj will read these as an empty string.
A new option/attribute has been added (``write_empty_values``) to allow
ConfigObj to write empty strings as empty values.
.. raw:: html
{+coloring}
from configobj import ConfigObj
cfg = '''
key =
key2 = # a comment
'''.splitlines()
config = ConfigObj(cfg)
print config
ConfigObj({'key': '', 'key2': ''})
config.write_empty_values = True
for line in config.write():
print line
key =
key2 = # a comment
{-coloring}
unrepr mode
===========
The ``unrepr`` option allows you to store and retrieve the basic Python
data-types using config files. It has to use a slightly different syntax to
normal ConfigObj files. Unsurprisingly it uses Python syntax.
This means that lists are different (they are surrounded by square brackets),
and strings *must* be quoted.
The types that ``unrepr`` can work with are :
| strings, lists tuples
| None, True, False
| dictionaries, integers, floats
| longs and complex numbers
You can't store classes, types or instances.
``unrepr`` uses ``repr(object)`` to write out values, so it currently *doesn't*
check that you are writing valid objects. If you attempt to read an unsupported
value, ConfigObj will raise a ``configobj.UnknownType`` exception.
Values that are triple quoted cased. The triple quotes are removed *before*
converting. This means that you can use triple quotes to write dictionaries
over several lines in your config files. They won't be written like this
though.
If you are writing config files by hand, for use with ``unrepr``, you should
be aware of the following differences from normal ConfigObj syntax :
| List : ``['A List', 'With', 'Strings']``
| Strings : ``"Must be quoted."``
| Backslash : ``"The backslash must be escaped \\"``
These all follow normal Python syntax.
In unrepr mode *inline comments* are not saved. This is because lines are
parsed using the `compiler package `_
which discards comments.
String Interpolation
====================
ConfigObj allows string interpolation *similar* to the way ``ConfigParser``
or ``string.Template`` work. The value of the ``interpolation`` attribute
determines which style of interpolation you want to use. Valid values are
"ConfigParser" or "Template" (case-insensitive, so "configparser" and
"template" will also work). For backwards compatibility reasons, the value
``True`` is also a valid value for the ``interpolation`` attribute, and
will select ``ConfigParser``-style interpolation. At some undetermined point
in the future, that default *may* change to ``Template``-style interpolation.
For ``ConfigParser``-style interpolation, you specify a value to be
substituted by including ``%(name)s`` in the value.
For ``Template``-style interpolation, you specify a value to be substituted
by including ``${cl}name{cr}`` in the value. Alternately, if 'name' is a valid
Python identifier (i.e., is composed of nothing but alphanumeric characters,
plus the underscore character), then the braces are optional and the value
can be written as ``$name``.
Note that ``ConfigParser``-style interpolation and ``Template``-style
interpolation are mutually exclusive; you cannot have a configuration file
that's a mix of one or the other. Pick one and stick to it. ``Template``-style
interpolation is simpler to read and write by hand, and is recommended if
you don't have a particular reason to use ``ConfigParser``-style.
Interpolation checks first the current section to see if ``name`` is the key
to a value. ('name' is case sensitive).
If it doesn't find it, next it checks the 'DEFAULT' sub-section of the current
section.
If it still doesn't find it, it moves on to check the parent section and the
parent section's 'DEFAULT' subsection, and so on all the way up to the main
section.
If the value specified isn't found in any of these locations, then a
``MissingInterpolationOption`` error is raised (a subclass of
``ConfigObjError``).
If it is found then the returned value is also checked for substitutions. This
allows you to make up compound values (for example directory paths) that use
more than one default value. It also means it's possible to create circular
references. If there are any circular references which would cause an infinite
interpolation loop, an ``InterpolationLoopError`` is raised.
Both of these errors are subclasses of ``InterpolationError``, which is a
subclass of ``ConfigObjError``.
String interpolation and validation don't play well together. This is because
validation overwrites values - and so may erase the interpolation references.
See `Validation and Interpolation`_. (This can only happen if validation
has to *change* the value).
Comments
========
Any line that starts with a '#', possibly preceded by whitespace, is a comment.
If a config file starts with comments then these are preserved as the
initial_comment_.
If a config file ends with comments then these are preserved as the
final_comment_.
Every key or section marker may have lines of comments immediately above it.
These are saved as the ``comments`` attribute of the section. Each member is a
list of lines.
You can also have a comment inline with a value. These are saved as the
``inline_comments`` attribute of the section, with one entry per member of the
section.
Subsections (section markers in the config file) can also have comments.
See `Section Attributes`_ for more on these attributes.
These comments are all written back out by the ``write`` method.
flatten_errors
==============
::
flatten_errors(cfg, res)
Validation_ is a powerful way of checking that the values supplied by the user
make sense.
The validate_ method returns a results dictionary that represents pass or fail
for each value. This doesn't give you any information about *why* the check
failed.
``flatten_errors`` is an example function that turns a results dictionary into
a flat list, that only contains values that *failed*.
``cfg`` is the ConfigObj instance being checked, ``res`` is the results
dictionary returned by ``validate``.
It returns a list of keys that failed. Each member of the list is a tuple : ::
([list of sections...], key, result)
If ``validate`` was called with ``preserve_errors=False`` (the default)
then ``result`` will always be ``False``.
*list of sections* is a flattened list of sections that the key was found
in.
If the section was missing then key will be ``None``.
If the value (or section) was missing then ``result`` will be ``False``.
If ``validate`` was called with ``preserve_errors=True`` and a value
was present, but failed the check, then ``result`` will be the exception
object returned. You can use this as a string that describes the failure.
For example :
*The value "3" is of the wrong type*.
Example Usage
-------------
The output from ``flatten_errors`` is a list of tuples.
Here is an example of how you could present this information to the user.
.. raw:: html
{+coloring}
vtor = validate.Validator()
# ini is your config file - cs is the configspec
cfg = ConfigObj(ini, configspec=cs)
res = cfg.validate(vtor, preserve_errors=True)
for entry in flatten_errors(cfg, res):
# each entry is a tuple
section_list, key, error = entry
if key is not None:
section_list.append(key)
else:
section_list.append('[missing section]')
section_string = ', '.join(section_list)
if error == False:
error = 'Missing value or section.'
print section_string, ' = ', error
{-coloring}
ConfigObj 3
===========
ConfigObj 3 is now deprecated in favour of ConfigObj 4. I can fix bugs in
ConfigObj 3 if needed, though.
For anyone who still needs it, you can download it here: `ConfigObj 3.3.1`_
You can read the old docs at : `ConfigObj 3 Docs`_
.. _ConfigObj 3.3.1: http://www.voidspace.org.uk/cgi-bin/voidspace/downman.py?file=configobj3.zip
.. _ConfigObj 3 Docs: http://www.voidspace.org.uk/python/configobj3.html
CREDITS
=======
ConfigObj 4 is written by (and copyright) `Michael Foord`_ and
`Nicola Larosa`_.
Particularly thanks to Nicola Larosa for help on the config file spec, the
validation system and the doctests.
*validate.py* was originally written by Michael Foord and Mark Andrews.
Thanks to others for input and bugfixes.
LICENSE
=======
ConfigObj, and related files, are licensed under the BSD license. This is a
very unrestrictive license, but it comes with the usual disclaimer. This is
free software: test it, break it, just don't blame us if it eats your data !
Of course if it does, let us know and we'll fix the problem so it doesn't
happen to anyone else {sm;:-)}. ::
Copyright (c) 2004 - 2008, Michael Foord & Nicola Larosa
All rights reserved.
Redistribution and use in source and binary forms, with or without
modification, are permitted provided that the following conditions are
met:
* Redistributions of source code must retain the above copyright
notice, this list of conditions and the following disclaimer.
* Redistributions in binary form must reproduce the above
copyright notice, this list of conditions and the following
disclaimer in the documentation and/or other materials provided
with the distribution.
* Neither the name of Michael Foord nor Nicola Larosa
may be used to endorse or promote products derived from this
software without specific prior written permission.
THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
You should also be able to find a copy of this license at : `BSD License`_
.. _BSD License: http://www.voidspace.org.uk/python/license.shtml
TODO
====
Better support for configuration from multiple files, including tracking
*where* the original file came from and writing changes to the correct
file.
Make ``newline`` an option (as well as an attribute) ?
``UTF16`` encoded files, when returned as a list of lines, will have the
BOM at the start of every line. Should this be removed from all but the
first line ?
Option to set warning type for unicode decode ? (Defaults to strict).
A method to optionally remove uniform indentation from multiline values.
(do as an example of using ``walk`` - along with string-escape)
Should the results dictionary from validate be an ordered dictionary if
`odict `_ is available ?
Implement some of the sequence methods (which include slicing) from the
newer ``odict`` ?
Preserve line numbers of values (and possibly the original text of each value).
ISSUES
======
.. note::
Please file any bug reports to `Michael Foord`_ or the **ConfigObj**
`Mailing List`_.
There is currently no way to specify the encoding of a configspec file.
When using ``copy`` mode for validation, it won't copy ``DEFAULT``
sections. This is so that you *can* use interpolation in configspec
files.
``validate`` doesn't report *extra* values or sections.
You can't have a keyword with the same name as a section (in the same
section). They are both dictionary keys - so they would overlap.
ConfigObj doesn't quote and unquote values if ``list_values=False``.
This means that leading or trailing whitespace in values will be lost when
writing. (Unless you manually quote).
Interpolation checks first the current section, then the 'DEFAULT' subsection
of the current section, before moving on to the current section's parent and
so on up the tree.
Does it matter that we don't support the ':' divider, which is supported
by ``ConfigParser`` ?
String interpolation and validation don't play well together. When
validation changes type it sets the value. This will correctly fetch the
value using interpolation - but then overwrite the interpolation reference.
If the value is unchanged by validation (it's a string) - but other types
will be.
CHANGELOG
=========
This is an abbreviated changelog showing the major releases up to version 4.
From version 4 it lists all releases and changes.
2008/02/05 - Version 4.5.2
--------------------------
Distribution updated to include version 0.3.2 of validate_. This means that
``None`` as a default value win configspecs works.
2008/02/05 - Version 4.5.1
--------------------------
Distribution updated to include version 0.3.1 of validate_. This means that
Unicode configspecs now work.
2008/02/05 - Version 4.5.0
--------------------------
ConfigObj will now guarantee that files will be written terminated with a
newline.
ConfigObj will no longer attempt to import the ``validate`` module, until/unless
you call ``ConfigObj.validate`` with ``preserve_errors=True``. This makes it
faster to import.
New methods ``restore_default`` and ``restore_defaults``. ``restore_default``
resets an entry to its default value (and returns that value). ``restore_defaults``
resets all entries to their default value. It doesn't modify entries without a
default value. You must have validated a ConfigObj (which populates the
``default_values`` dictionary) before calling these methods.
BUGFIX: Proper quoting of keys, values and list values that contain hashes
(when writing). When ``list_values=False``, values containing hashes are
triple quoted.
Added the ``reload`` method. This reloads a ConfigObj from file. If the filename
attribute is not set then a ``ReloadError`` (a new exception inheriting from
``IOError``) is raised.
BUGFIX: Files are read in with 'rb' mode, so that native/non-native line endings work!
Minor efficiency improvement in ``unrepr`` mode.
Added missing docstrings for some overidden dictionary methods.
Added the ``reset`` method. This restores a ConfigObj to a freshly created state.
Removed old CHANGELOG file.
2007/02/04 - Version 4.4.0
--------------------------
Official release of 4.4.0
2006/12/17 - Version 4.3.3-alpha4
---------------------------------
By Nicola Larosa
Allowed arbitrary indentation in the ``indent_type`` parameter, removed the
``NUM_INDENT_SPACES`` and ``MAX_INTERPOL_DEPTH`` (a leftover) constants,
added indentation tests (including another docutils workaround, sigh), updated
the documentation.
By Michael Foord
Made the import of ``compiler`` conditional so that ``ConfigObj`` can be used
with `IronPython `_.
2006/12/17 - Version 4.3.3-alpha3
---------------------------------
By Nicola Larosa
Added a missing ``self.`` in the _handle_comment method and a related test,
per Sourceforge bug #1523975.
2006/12/09 - Version 4.3.3-alpha2
---------------------------------
By Nicola Larosa
Changed interpolation search strategy, based on this patch by Robin Munn:
http://sourceforge.net/mailarchive/message.php?msg_id=17125993
2006/12/09 - Version 4.3.3-alpha1
---------------------------------
By Nicola Larosa
Added Template-style interpolation, with tests, based on this patch by
Robin Munn: http://sourceforge.net/mailarchive/message.php?msg_id=17125991
(awful archives, bad Sourceforge, bad).
2006/06/04 - Version 4.3.2
--------------------------
Changed error handling, if parsing finds a single error then that error will
be re-raised. That error will still have an ``errors`` and a ``config``
attribute.
Fixed bug where '\\n' terminated files could be truncated.
Bugfix in ``unrepr`` mode, it couldn't handle '#' in values. (Thanks to
Philippe Normand for the report.)
As a consequence of this fix, ConfigObj doesn't now keep inline comments in
``unrepr`` mode. This is because the parser in the `compiler package`_
doesn't keep comments. {sm;:-)}
Error messages are now more useful. They tell you the number of parsing errors
and the line number of the first error. (In the case of multiple errors.)
Line numbers in exceptions now start at 1, not 0.
Errors in ``unrepr`` mode are now handled the same way as in the normal mode.
The errors stored will be an ``UnreprError``.
2006/04/29 - Version 4.3.1
--------------------------
Added ``validate.py`` back into ``configobj.zip``. (Thanks to Stewart
Midwinter)
Updated to `validate.py`_ 0.2.2.
Preserve tuples when calling the ``dict`` method. (Thanks to Gustavo Niemeyer.)
Changed ``__repr__`` to return a string that contains ``ConfigObj({ ... })``.
Change so that an options dictionary isn't modified by passing it to ConfigObj.
(Thanks to Artarious.)
Added ability to handle negative integers in ``unrepr``. (Thanks to Kevin
Dangoor.)
2006/03/24 - Version 4.3.0
--------------------------
Moved the tests and the CHANGELOG (etc) into a separate file. This has reduced
the size of ``configobj.py`` by about 40%.
Added the ``unrepr`` mode to reading and writing config files. Thanks to Kevin
Dangoor for this suggestion.
Empty values are now valid syntax. They are read as an empty string ``''``.
(``key =``, or ``key = # comment``.)
``validate`` now honours the order of the configspec.
Added the ``copy`` mode to validate. Thanks to Louis Cordier for this
suggestion.
Fixed bug where files written on windows could be given ``'\r\r\n'`` line
terminators.
Fixed bug where last occurring comment line could be interpreted as the
final comment if the last line isn't terminated.
Fixed bug where nested list values would be flattened when ``write`` is
called. Now sub-lists have a string representation written instead.
Deprecated ``encode`` and ``decode`` methods instead.
You can now pass in a ConfigObj instance as a configspec (remember to read
the configspec file using ``list_values=False``).
Sorted footnotes in the docs.
2006/02/16 - Version 4.2.0
--------------------------
Removed ``BOM_UTF8`` from ``__all__``.
The ``BOM`` attribute has become a boolean. (Defaults to ``False``.) It is
*only* ``True`` for the ``UTF16/UTF8`` encodings.
File like objects no longer need a ``seek`` attribute.
Full unicode support added. New options/attributes ``encoding``,
``default_encoding``.
ConfigObj no longer keeps a reference to file like objects. Instead the
``write`` method takes a file like object as an optional argument. (Which
will be used in preference of the ``filename`` attribute if that exists as
well.)
utf16 files decoded to unicode.
If ``BOM`` is ``True``, but no encoding specified, then the utf8 BOM is
written out at the start of the file. (It will normally only be ``True`` if
the utf8 BOM was found when the file was read.)
Thanks to Aaron Bentley for help and testing on the unicode issues.
File paths are *not* converted to absolute paths, relative paths will
remain relative as the ``filename`` attribute.
Fixed bug where ``final_comment`` wasn't returned if ``write`` is returning
a list of lines.
Deprecated ``istrue``, replaced it with ``as_bool``.
Added ``as_int`` and ``as_float``.
2005/12/14 - Version 4.1.0
--------------------------
Added ``merge``, a recursive update.
Added ``preserve_errors`` to ``validate`` and the ``flatten_errors``
example function.
Thanks to Matthew Brett for suggestions and helping me iron out bugs.
Fixed bug where a config file is *all* comment, the comment will now be
``initial_comment`` rather than ``final_comment``.
Validation no longer done on the 'DEFAULT' section (only in the root level).
This allows interpolation in configspecs.
Also use the new list syntax in validate_ 0.2.1. (For configspecs).
2005/12/02 - Version 4.0.2
--------------------------
Fixed bug in ``create_empty``. Thanks to Paul Jimenez for the report.
2005/11/05 - Version 4.0.1
--------------------------
Fixed bug in ``Section.walk`` when transforming names as well as values.
Added the ``istrue`` method. (Fetches the boolean equivalent of a string
value).
Fixed ``list_values=False`` - they are now only quoted/unquoted if they
are multiline values.
List values are written as ``item, item`` rather than ``item,item``.
2005/10/17 - Version 4.0.0
--------------------------
**ConfigObj 4.0.0 Final**
Fixed bug in ``setdefault``. When creating a new section with setdefault the
reference returned would be to the dictionary passed in *not* to the new
section. Bug fixed and behaviour documented.
Obscure typo/bug fixed in ``write``. Wouldn't have affected anyone though.
2005/09/09 - Version 4.0.0 beta 5
---------------------------------
Removed ``PositionError``.
Allowed quotes around keys as documented.
Fixed bug with commas in comments. (matched as a list value)
2005/09/07 - Version 4.0.0 beta 4
---------------------------------
Fixed bug in ``__delitem__``. Deleting an item no longer deletes the
``inline_comments`` attribute.
Fixed bug in initialising ConfigObj from a ConfigObj.
Changed the mailing list address.
2005/08/28 - Version 4.0.0 beta 3
---------------------------------
Interpolation is switched off before writing out files.
Fixed bug in handling ``StringIO`` instances. (Thanks to report from
Gustavo Niemeyer.)
Moved the doctests from the ``__init__`` method to a separate function.
(For the sake of IDE calltips).
2005/08/25 - Version 4.0.0 beta 2
---------------------------------
Amendments to *validate.py*.
First public release.
2005/08/21 - Version 4.0.0 beta 1
---------------------------------
Reads nested subsections to any depth.
Multiline values.
Simplified options and methods.
New list syntax.
Faster, smaller, and better parser.
Validation greatly improved. Includes:
* type conversion
* default values
* repeated sections
Improved error handling.
Plus lots of other improvements. {sm;:grin:}
2004/05/24 - Version 3.0.0
--------------------------
Several incompatible changes: another major overhaul and change. (Lots of
improvements though).
Added support for standard config files with sections. This has an entirely
new interface: each section is a dictionary of values.
Changed the update method to be called writein: update clashes with a dict
method.
Made various attributes keyword arguments, added several.
Configspecs and orderlists have changed a great deal.
Removed support for adding dictionaries: use update instead.
Now subclasses a new class called caselessDict. This should add various
dictionary methods that could have caused errors before.
It also preserves the original casing of keywords when writing them back out.
Comments are also saved using a ``caselessDict``.
Using a non-string key will now raise a ``TypeError`` rather than converting
the key.
Added an exceptions keyword for *much* better handling of errors.
Made ``creatempty=False`` the default.
Now checks indict *and* any keyword args. Keyword args take precedence over
indict.
``' ', ':', '=', ','`` and ``'\t'`` are now all valid dividers where the
keyword is unquoted.
ConfigObj now does no type checking against configspec when you set items.
delete and add methods removed (they were unnecessary).
Docs rewritten to include all this gumph and more; actually ConfigObj is
*really* easy to use.
Support for stdout was removed.
A few new methods added.
Charmap is now incorporated into ConfigObj.
2004/03/14 - Version 2.0.0 beta
-------------------------------
Re-written it to subclass dict. My first forays into inheritance and operator
overloading.
The config object now behaves like a dictionary.
I've completely broken the interface, but I don't think anyone was really
using it anyway.
This new version is much more 'classy'. {sm;:wink:}
It will also read straight from/to a filename and completely parse a config
file without you *having* to supply a config spec.
Uses listparse, so can handle nested list items as values.
No longer has getval and setval methods: use normal dictionary methods, or add
and delete.
2004/01/29 - Version 1.0.5
--------------------------
Version 1.0.5 has a couple of bugfixes as well as a couple of useful additions
over previous versions.
Since 1.0.0 the buildconfig function has been moved into this distribution,
and the methods reset, verify, getval and setval have been added.
A couple of bugs have been fixed.
Origins
-------
ConfigObj originated in a set of functions for reading config files in the
`atlantibots `_ project. The original
functions were written by Rob McNeur.
----------
Footnotes
=========
.. [#] And if you discover any bugs, let us know. We'll fix them quickly.
.. [#] If you specify a filename that doesn't exist, ConfigObj will assume you
are creating a new one. See the *create_empty* and *file_error* options_.
.. [#] They can be byte strings (*ordinary* strings) or Unicode.
.. [#] Except we don't support the RFC822 style line continuations, nor ':' as
a divider.
.. [#] This is a change in ConfigObj 4.2.0. Note that ConfigObj doesn't call
the seek method of any file like object you pass in. You may want to call
``file_object.seek(0)`` yourself, first.
.. [#] A side effect of this is that it enables you to copy a ConfigObj :
.. raw:: html
{+coloring}
# only copies members
# not attributes/comments
config2 = ConfigObj(config1)
{-coloring}
The order of values and sections will not be preserved, though.
.. [#] Other than lists of strings.
.. [#] The exception is if it detects a ``UTF16`` encoded file which it
must decode before parsing.
.. [#] The method signature shows that this method takes
two arguments. The second is the section to be written. This is because the
``write`` method is called recursively.
.. [#] The dict method doesn't actually use the deepcopy mechanism. This means
if you add nested lists (etc) to your ConfigObj, then the dictionary
returned by dict may contain some references. For all *normal* ConfigObjs
it will return a deepcopy.
.. [#] Passing ``(section, key)`` rather than ``(value, key)`` allows you to
change the value by setting ``section[key] = newval``. It also gives you
access to the *rename* method of the section.
.. [#] Minimum required version of *validate.py* 0.2.0 .
.. note::
Rendering this document with docutils also needs the
textmacros module and the PySrc CSS stuff. See
http://www.voidspace.org.uk/python/firedrop2/textmacros.shtml
.. raw:: html
.. _listquote: http://www.voidspace.org.uk/python/modules.shtml#listquote
.. _Michael Foord: http://www.voidspace.org.uk/python/weblog/index.shtml
.. _Nicola Larosa: http://www.teknico.net