Welcome to PurlDB documentation!

Getting Started


PURLdb is a database of packages, with package metadata and indexes for package files and archives, and various API endpoints to get data about these packages and match to other codebases.

PURLdb toolkit

purldb-toolkit is command line utility and library to use the PurlDB, its API and various related libraries.


Matchcode has the functionalities to index archives, files and directories for purldb packages and API endpoints to make matching available. A ScanCode.io pipeline for matching is also present to match scanned codebases.

How-To Documents

How-To documents explain how to accomplish specific tasks.

Indices and tables