CLI
The skeinrank CLI is the most direct way to test a company dictionary locally.
Validate a dictionary
Section titled “Validate a dictionary”skeinrank validate-dictionary company_dictionary.jsonskeinrank validate-dictionary company_dictionary.json --jsonExtract from raw text
Section titled “Extract from raw text”skeinrank extract "k8s rollout uses pg database" \ --text \ --dictionary company_dictionary.jsonUseful flags:
--compactfor smaller JSON output--max-matchesto bound output size--context-charsto control highlight context--outputto write results to a file
Extract from a document
Section titled “Extract from a document”skeinrank extract incident-runbook.md \ --dictionary company_dictionary.jsonSupported lightweight formats include text-like files, Markdown, logs, CSV/TSV, JSON/JSONL, YAML-like files, HTML, and DOCX. PDF extraction is available when pypdf is installed by the caller.
Canonicalize text
Section titled “Canonicalize text”skeinrank canonicalize "k8s rollout uses pg database" \ --text \ --dictionary company_dictionary.jsonExtract plain document text
Section titled “Extract plain document text”skeinrank document-text incident-runbook.docx --output incident-runbook.txt