Quickstart
This quickstart demonstrates the smallest useful SkeinRank path: dictionary → text → canonical values. It does not require the governance API, UI, PostgreSQL, Elasticsearch, or background workers.
For the full platform beta with the governance console, bindings, enrichment jobs, snapshots, and Search Playground, use the Docker beta quickstart.
1. Create a dictionary
Section titled “1. Create a dictionary”Save this as company_dictionary.json:
{ "profile_name": "infra_incidents", "profile_description": "Small infrastructure incident dictionary", "terms": [ { "canonical_value": "kubernetes", "slot": "TOOL", "aliases": ["k8s", "kube", "kuber"] }, { "canonical_value": "postgresql", "slot": "DATABASE", "aliases": ["pg", "postgres", "psql"] } ], "profile_stop_list": [], "global_stop_list": []}2. Validate the dictionary
Section titled “2. Validate the dictionary”skeinrank validate-dictionary company_dictionary.json3. Extract terms from text
Section titled “3. Extract terms from text”skeinrank extract "k8s rollout uses pg database" \ --text \ --dictionary company_dictionary.jsonExpected canonical values:
["kubernetes", "postgresql"]4. Canonicalize the text
Section titled “4. Canonicalize the text”skeinrank canonicalize "k8s rollout uses pg database" \ --text \ --dictionary company_dictionary.jsonExpected output:
kubernetes rollout uses postgresql database5. Use Python instead
Section titled “5. Use Python instead”from skeinrank import load_dictionary, extract_terms, canonicalize_text
dictionary = load_dictionary("company_dictionary.json")
result = extract_terms( "k8s rollout uses pg database", dictionary=dictionary,)print(result.canonical_values)
canonicalized = canonicalize_text( "k8s rollout uses pg database", dictionary=dictionary,)print(canonicalized.text)Why this path is the public MVP
It shows the core value without needing a full platform deployment. The UI, PostgreSQL store, and Elasticsearch enrichment workflow can be introduced after the end-to-end case is tested.