features/domain¶
Privacy-preserving browser domain classification. Maps eTLD+1 domains to semantic categories without storing raw URLs.
taskclf.features.domain
¶
Privacy-preserving browser domain classification.
Maps eTLD+1 domains (e.g. "github.com") to semantic categories
without storing full URLs, paths, or query strings. Only the domain
category string is persisted — never the raw domain or URL.
When no domain information is available (e.g. no aw-watcher-web
integration), the classifier falls back to "unknown" for browser
apps and "non_browser" for non-browser apps.
See docs/guide/privacy.md §3.4 for the data-handling contract.
classify_domain(domain, *, is_browser=True)
¶
Map a domain string to a privacy-safe category.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
domain
|
str | None
|
An eTLD+1 or subdomain string (e.g. |
required |
is_browser
|
bool
|
Whether the foreground app is a browser. |
True
|
Returns:
| Type | Description |
|---|---|
str
|
One of :data: |