OS Open Built Up Areas is supplied as a single GeoPackage for Great Britain. GeoPackage (*.gpkg) is an open, non-proprietary, platform-independent, standards-based data format for geographic information systems (GIS), as defined by the Open Geospatial Consortium (OGC). It is designed to be a lightweight format that can contain large amounts of varied and complex data in a single, easy-to-distribute and ready- to-use file. GeoPackage is natively supported by numerous software applications.
GeoPackage offers users the following benefits:
The single file is easy to transfer and offers the end user a rich experience.
Attribute names are not limited in length, making the format user friendly.
The file size limit is large at 140 TB.
A file size limit could be imposed by the file system to which the file is written.
It supports raster, vector and database formats, making it a highly versatile solution.
It is an OGC standard.
In most cases, it is a plug-and-play format.
A comma-separated values (CSV) file is a common interchange format for spreadsheets and databases that facilitates the simplistic use of data. Each field is either textual or numeric. Within the CSV, each field is separated from the next by a comma. CSV file format is universally supported for easy ingestion into all major database products.
CSV files are designed to be opened in a database or GIS application and opening them in other software applications might corrupt the data. In particular, Excel has a row limit that is easily exceeded by large CSV files. We recommend that you load CSV files directly into a database or GIS, rather than trying to open these files in Excel.
The CSV supply has the following features:
Geometry is provided as Well-Known Text (WKT).
Header rows are included in each file.
There is one record per line in each file.
Fields are separated by commas.
Where string fields contain commas, they are delimited by double quotes.
Double quotes inside strings are escaped by doubling.
Records are terminated by carriage returns and line feeds.
Files use UTF-8 character encoding.
The names of attributes in CSV and GeoPackage format are very similar. The following table maps CSV attribute names to GeoPackage attribute names:
CSV attributes | GeoPackage attribute |
---|---|
*
fid1
gsscode2
gsscode2
relatedtogsscode3
relatedtogsscode3
name1_text
name1_text
name1_language
name1_language
name2_text
name2_text
name2_language
name2_language
areahectares
areahectares
geometry_area_m
geometry_area_m
geometry4
*5
* Indicates that the attribute is not mapped between the formats.
1 fid is an additional attribute. It is an INTEGER NOT NULL column that acts as a primary key. This attribute is a requirement of the OGC GeoPackage specification.
2 Present in the Built Up Areas Feature Type only.
3 Present in the Built Up Extents and Non-Built Up Extents Feature Types only.
4 CSV contains the Well-Known Text (WKT) data type that represents the vector geometry objects of the feature.
5 The geometry column is always the second column, but the attribute (or its value) is typically not visible in GIS software. GeoPackage is based on Well-Known Binary (WKB) as defined in ISO/IEC 13249-3:2011.
For information on how to open, use and understand a GeoPackage dataset, please refer to our Getting Started with GeoPackage guide. For further information on GeoPackage, please see the GeoPackage website.