Scrubber


  • There have been several scrubbing and deidentification attempts through the years. This one seems to have gotten the furthest. The other ones are classify and release-manager. A new module was created called Capability::Deidentify, which uses system::DeID as one possible engine. Unfortunately, the overhead on running this is quite high per invocation, but DeID being written in Perl should be amenable to being unilang-agentified properly. Another feature of scrubber is that it has many keybindings for redacting text, and asserts redacted text into the KB and can automatically redact it from the selected or a set of documents. KBFS tracks the metadata regarding the files.