Main Page

From BitCurator
Jump to: navigation, search

Welcome to the BitCurator wiki!

This site is the documentation portal for the BitCurator family of research projects: BitCurator NLP, BitCurator Access, and BitCurator. Looking for the latest release of the BitCurator Environment? Links to popular downloads and documentation are in the Quick Links section on the right hand side of the page. Visit the BitCurator blog for recent news and information about the Andrew W. Mellon Foundation grants that have supported these projects.

Content on this wiki and the BitCurator blog is Creative Commons-licensed.

The BitCurator wiki currently includes 23 pages. View statistics on users, pages, and edits.


The BitCurator wiki is a community-maintained resource. To get edit access, click on Request account. You'll be asked for your name, email address, and a brief bio (why we require this information). An admin will approve your account (typically 24-48 hours), and you will receive an email with temporary credentials. After logging in you'll be able to change them and edit or add pages on the wiki.

Code contributions can be submitted as pull requests to specific projects listed on our GitHub BitCurator organization profile.

Ongoing support for previously grant-funded BitCurator projects is provided by the BitCurator Consortium. If you or your organization use BitCurator tools, please consider joining the Consortium!

Quick Links

BitCurator Environment (Current Release)

BitCurator Virtual Machine (v1.7.92)

[Torrent] [iBiblio Mirror] [MD5]

BitCurator Installation ISO (v1.7.92)

[Torrent] [iBiblio Mirror] [MD5]

Quickstart Guide Installing and using BitCurator.

BitCurator Access Tools (Releases on GitHub)

Visit the bitcurator-access-webtools repo to get the software.
Quickstart Guide (bitcurator-access-webtools) Installing and using bitcurator-access-webtools.
Visit the bca-redtools repo to get the software.
Quickstart Guide (bca-redtools) Installing and using bca-redtools.

BitCurator NLP Tools

Visit the BitCurator NLP page for current info.

Online Help

BitCurator User Group Get support and discuss issues with the community.
Screencasts and Video Tutorials Useful screencasts on our YouTube channel.

BitCurator Projects

You can find source code and documentation associated with the BitCurator family of projects (Bitcurator NLP, BitCurator Access, and BitCurator) linked on our GitHub IO page at For more detailed information on the projects (including workflows, design descriptions, research materials, and tool walkthroughs), follow the links to the relevant pages on this wiki in the sections below.

Visit the BitCurator NLP page.
Visit the BitCurator Access page.
Visit the BitCurator page.

Brief descriptions of these projects and their products are provided below.

  1. BitCurator NLP

    The BitCurator NLP project is developing software for collecting institutions to extract, analyze, and produce reports on features of interest in text extracted from born-digital materials contained in collections. The software will use existing natural language processing software libraries to identify and report on those items (such as entities and topics) likely to be relevant to ongoing preservation, information organization, and access activities.

  2. BitCurator Access

    The BitCurator Access project developed tools to assist collecting institutions provide web-based and local access to born-digital materials held on disk images. This software includes BitCurator Access Webtools, a virtualized web service to simplify access to raw and forensically-packaged disk images, and BitCurator Access Redaction Tools, a toolset to streamline redaction of targeted patterns in raw data streams.

  3. BitCurator

    The BitCurator project focused on bringing digital forensics tools and methods to libraries, archives, and museums working with born-digital materials. This project developed the BitCurator Environment, a Ubuntu-derived Linux distribution geared towards the needs of these institutions. It includes a suite of open source digital forensics and data analysis tools to help collecting institutions process born-digital materials.

    Assets, code, and documentation describing how the BitCurator environment is built is are provided in our repositories on GitHub. Visit our GitHub IO site for an overview of how these repositories are organized. The environmental assets repository includes detailed instructions describing the process of building the environment and generating the ISO.

Development, Funding, and Partners

The BitCurator development team is hosted by the School of Information and Library Science at the University of North Carolina, Chapel Hill. Grants from the Andrew W. Mellon Foundation supported the BitCurator project (a partnership between UNC SILS and the Maryland Institute for Technology in the Humanities) through September 2014, and the BitCurator Access project through September 2016. A grant from the Andrew W. Mellon Foundation currently supports the BitCurator NLP project (2016-2018).

BitCurator Consortium members provide continuing support for past projects. If you or your organization use BitCurator tools, please consider joining the Consortium!