Automating a country’s basic data to gain time in a disaster involves many complex variables. Since 2022, MapAction, the United Nations Office for the Coordination of Humanitarian Affairs (UN OCHA) and the University of Georgia, Information Technology Outreach Services (ITOS) have been working on strengthening data quality for what are referred to as Common Operational Datasets (COD): ‘best available’ shared datasets that ensure consistency and simplify the discovery and exchange of key data among humanitarian organisations. In this article, we briefly present the methodology.
![](https://mapaction.org/wp-content/uploads/2025/01/CODAB-data-quality-1024x601.png)
Common Operational Datasets, or CODs, are authoritative reference datasets needed to support operations and decision-making for all actors in a humanitarian response. CODs are ‘best available’ datasets that ensure consistency and simplify the discovery and exchange of key data. The data is typically geo-spatially linked using a coordinate system and has unique geographic identification codes. These datasets are often derived from data collected by local authorities and international partners to ensure quality, but most vitally, local ownership. CODs can be collected on administrative boundaries, population and more.
The three main datasets available within the CODs for most countries are COD-AB (Administrative Boundaries), COD-PS (Population Statistics) and COD-HP (Humanitarian Profile). Country-specific CODs, such as the road network and health facilities may also be available. They can be easily found on HDX (Humanitarian Data Exchange) on the COD page or through the service API managed by ITOS.
Moreover, OCHA routinely evaluates the quality and availability of Common Operational Databases. The results of this analysis can be found on the COD Portal. The Portal documents:
- The quality of administrative boundary (COD-AB) layers and population statistics (COD-PS) tables, and;
- Their availability on HDX and for COD-AB as an ITOS geoservices. The portal is generally refreshed after any COD-AB or COD-PS is added or updated.
![](https://mapaction.org/wp-content/uploads/2025/01/COD2.png)
Keeping humanitarian data fresh
This project focused on the COD-AB. As shown in the picture below, each available country has different level administrative boundaries together with metadata and p-codes.
READ ALSO: Accelerating humanitarian response: Inside MapAction’s Automated Data Pipeline
The proposed methodology evaluates both the geospatial features as well as the metadata.
![](https://mapaction.org/wp-content/uploads/2025/01/COD3-1024x491.png)
The objective of this project was to create a quantitative assessment mechanism that enables the prioritization of work to update or enhance existing COD-AB datasets. The output is a quality index for each COD-AB data set based on tests for features (geographical, metadata, etc), targeted at both OCHA and ITOS teams and available to the public as an open-source tool.
The main deliverable of this project is the COD-AB Data Quality dashboard. Through this webpage hosted on FieldMaps — Humanitarian Maps & Data , the quality multi-index per country can be visualized through a dedicated dashboard, downloaded as a spreadsheet or as individual country pdf reports detailing the methodology. Moreover, the Python code is publicly available on a GitHub code repository for advanced users.
Methodology
The methodology for computing each country COD-AB quality index is based on a multi-index approach where separate scores are computed for each category, covering both geospatial features as well as metadata such as languages and p-codes.
The overall score for a given country is a value between 0% and 100% and is computed as the average value of 10 different categories. For each category, the score is computed as the proportion of layers matching a set of quality criteria. Within the dashboard, users can select which categories they wish to use when computing the overall score.
Scores computed for each country as well as a detailed country report can be found on the publicly available dashboard webpage hosted on fieldmaps.
![](https://mapaction.org/wp-content/uploads/2025/01/COD4-1024x414.png)
![](https://mapaction.org/wp-content/uploads/2025/01/COD5-1024x601.png)
Methodology description
The complete list of indicators used within each category can be found on the dedicated country reports on the project dashboard. Below is a list of descriptions for each category used in the overall score calculation.
Valid Geometry: Valid geometry is defined by having no empty geometries, only containing polygons (no points or lines), not containing self-intersecting rings, using WGS84 CRS (EPSG:4326), and a valid bounding box.
Valid Topology: Valid topology is defined as having no triangle polygons, sliver gaps or overlaps within a layer, with each polygon being fully contained within their parent.
Equal Area: Layers which all share the same area. Layers not sharing the same area may have empty areas representing water bodies whereas other layers have them filled out.
Sq. km: Layers which have an area attribute in square kilometers and value matches area calculated above using NASA EASE-Grid 2.0.
P-Codes: Layers which have all required P-Code columns (ADM2_PCODE), with no empty cells, only alphanumeric characters, starting with a valid ISO-2 code, no duplicate codes, all codes within a column having the same length, and hierarchical nesting codes.
Names: Layers which have all required name columns (ADM2_EN), with no empty cells, no duplicate rows, no double / leading / trailing spaces, no columns all uppercase / lowercase, no cells lacking alphabetic characters, and all characters matching the language code.
Languages: Layers which have at least 1 language column detected, all language codes used are valid, a romanized language is featured first, and layers don’t have more languages than their parents.
Date: Layers which have a valid date value for their source.
Valid On: Layers which have been validated on within the last 12 months.
Other: Layers which have no fields other than expected values.
READ ALSO: MapAction Anticipatory Action Sustainability Methodology
Conclusion and way forward
The main target audience for this project are both OCHA FISS, OCHA HDX and ITOS teams that are directly involved in evaluating and improving the quality of the CODs. This project was tailored to their needs and it’s hoped to streamline the initial quality process. By making both the results and methodology publicly available online, we hope other CODs stakeholders will integrate quality analysis in their decision making.
This project has also been a great opportunity for collaboration between the MapAction team and Maxym Malynowsky, humanitarian data expert responsible for, among others, fieldmaps.io. Maxym joined the project together with MapAction volunteers and played an essential role on designing, implementing and promoting this work. Within his new role as Data Engineer Advisor at the OCHA Centre for Humanitarian Data, Maxym will be able to link this work with the quality needs of HDX datasets.
MapAction will continue to be involved on the COD data quality topic by both taking part in the sector discussions with key stakeholders and also as a direct user in the context of emergency responses.
The dashboard will continue to be hosted by FieldMaps — Humanitarian Maps & Data and Maxym Malynowsky will be the main point of contact.
This work is funded by USAID’s Bureau for Humanitarian Assistance (BHA).
![](https://mapaction.org/wp-content/uploads/2025/01/BHA-logo-300x116.png)
Find out more about MapAction’s data work in another field, Anticipatory Action, in the video below.