Aim: Extraction of significant environment information during the creation, manipulation and use of DOs.
Status: Finished
Partners responsible: Georg-August-Universität Göttingen, The University of Liverpool
License type: Apache v2 license (released on GitHub)
Language: Java
Additional information: Pericles blog post
Broader description: The PERICLES Extraction Tool (PET) is an open source (Apache 2 licensed) Java software for the extraction of significant information from the environment where digital objects are created and modified. This information supports object use and reuse, e.g. for a better long-term preservation of data. For the main part of the metadata extraction PET uses Apache TIKA and some other modules which are:

  • CPU specification snapshot
  • CPU usage monitoring
  • Calculate file checksum
  • Create custom executable command (file dependent)
  • Create custom executable command (file independent)
  • Directory Monitor Module
  • FQDN
  • File identification
  • File store information (java.nio.file)
  • File store information (sigar)
  • File system information snapshot
  • Google chrome opened tabs monitoring
  • Graphic System properties snapshot
  • Graphic card information module
  • Installed software snapshot
  • Java installation information snapshot
  • LSOF use monitor
  • List of network interfaces
  • Log expression grep
  • MediaInfo
  • Memory monitoring
  • Network information
  • OS X Spotlight Command module
  • Office document dependencies
  • Operating System properties snapshot
  • PDF Font dependencies
  • Posix file information monitoring
  • Process parameter
  • Process statistics monitoring
  • Regex text search
  • Screenshot module
  • System resources snapshot
  • System swap monitoring
  • TCP statistics monitoring
  • Uptime
  • Who (user, host, device, time)
  • Windows Handle monitoring daemon
  • XML xPath expression

Related

Wiki: PERICLES FP7 Project