PERICLES Extraction Tool (PET)
Partners responsible: Georg-August-Universität Göttingen, The University of Liverpool
Aim: Extraction of significant environment information during the creation, manipulation and use of DOs.
Status: Finished
License type: Apache v2 license (released on GitHub)
Additional information: Pericles blog post
Broader description: The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses Apache TIKA and some other modules which are:
- CPU specification snapshot
- CPU usage monitoring
- Calculate file checksum
- Create custom executable command (file dependent)
- Create custom executable command (file independent)
- Directory Monitor Module
- FQDN
- File identification
- File store information (java.nio.file)
- File store information (sigar)
- File system information snapshot
- Google chrome opened tabs monitoring
- Graphic System properties snapshot
- Graphic card information module
- Installed software snapshot
- Java installation information snapshot
- LSOF use monitor
- List of network interfaces
- Log expression grep
- MediaInfo
- Memory monitoring
- Network information
- OS X Spotlight Command module
- Office document dependencies
- Operating System properties snapshot
- PDF Font dependencies
- Posix file information monitoring
- Process parameter
- Process statistics monitoring
- Regex text search
- Screenshot module
- System resources snapshot
- System swap monitoring
- TCP statistics monitoring
- Uptime
- Who (user, host, device, time)
- Windows Handle monitoring daemon
- XML xPath expression