Loading...
Loading...
Use this skill whenever deciding what features to extract from raw marketplace assets — listing photos, owner-entered listing metadata, sitter wizard responses — to power item-to-item (similar listings), user-to-item (homefeed ranking), or user-to-user (mutual-fit matching) recommenders in a two-sided trust marketplace. Covers asset auditing, first-principles feature decomposition from the decision the user is making, vision-feature extraction (CLIP, room-type classification, amenity detection, aesthetic and quality scoring), listing text and metadata encoding (categoricals, multi-hot amenities, H3 geo-hashing, sentence-transformer description embeddings, structured pet triples), sitter wizard design (information-gain ordering, multiple-choice over free text, genuine skippability, hard constraint versus soft preference), derived-composition patterns for i2i / u2i / u2u (precomputed ANN shelves, multi-modal fusion, two-tower affinity, symmetric mutual-fit scoring, interpretable subscores), feature quality governance (single registry, training-serving parity, coverage and drift alarms, PII scrubbing, schema versioning), and incremental value proof (one feature at a time, ablation A/B, kill reviews, exploration slice, permanent feature-free baseline). Trigger even when the user does not explicitly say "feature engineering" but is asking how to get more signal out of listing photos, listing metadata, or the sitter onboarding wizard, or how to improve i2i / u2i / u2u quality without blindly ingesting a new model.
npx skill4agent add pproenca/dot-skills marketplace-recsys-feature-engineeringmarketplace-personalisationmarketplace-search-recsys-planning| # | Category | Prefix | Impact |
|---|---|---|---|
| 1 | Asset Audit and Inventory | | CRITICAL |
| 2 | First-Principles Feature Decomposition | | CRITICAL |
| 3 | Image Feature Extraction | | HIGH |
| 4 | Listing Text and Metadata Extraction | | HIGH |
| 5 | Sitter Wizard and Profile Extraction | | HIGH |
| 6 | Derived Similarity and Affinity | | MEDIUM-HIGH |
| 7 | Feature Quality and Governance | | MEDIUM-HIGH |
| 8 | Incremental Rollout and Value Proof | | MEDIUM |
audit-measure-coverage-before-modellingaudit-sample-every-asset-type-end-to-endaudit-verify-rights-and-privacy-before-extractionaudit-quantify-freshness-per-assetaudit-separate-raw-assets-from-derived-featuresfirstp-start-from-the-decision-not-the-algorithmfirstp-ask-what-signal-a-human-usesfirstp-tie-every-feature-to-a-specific-solutionfirstp-prefer-directly-observed-over-learnedfirstp-reject-features-you-cannot-serve-at-inferencefirstp-kill-features-a-popularity-baseline-already-capturesvision-use-clip-for-zero-shot-listing-embeddingsvision-detect-room-types-before-detecting-amenitiesvision-quantify-image-quality-separately-from-contentvision-extract-per-object-counts-not-just-presencen_bed = 4has_bed = truevision-pool-embeddings-across-a-listings-photo-setvision-fine-tune-on-your-domain-when-clip-underperformslisting-declare-categorical-fields-for-bounded-vocabularieslisting-multi-hot-encode-amenity-listslisting-hash-geo-to-hierarchies-not-raw-lat-lonlisting-embed-description-with-pretrained-sentence-encoderlisting-extract-stay-duration-shape-not-just-lengthlisting-encode-pet-requirements-as-structured-triples(species, count, special_needs)wizard-order-questions-by-information-gainwizard-prefer-multiple-choice-over-free-textwizard-make-skips-genuine-and-log-themwizard-capture-experience-as-counts-and-dateswizard-separate-hard-constraints-from-soft-preferencesderive-precompute-i2i-nearest-neighbours-offlinederive-fuse-modalities-before-item-similarityderive-use-two-tower-for-user-item-affinityderive-score-u2u-as-symmetric-mutual-fitmin(P(owner), P(sitter))derive-decompose-affinity-into-interpretable-subscoresderive-cache-user-embedding-with-short-ttlquality-version-feature-definitions-in-one-registryquality-serve-training-and-inference-from-one-storequality-gate-features-on-coverage-and-driftquality-scrub-pii-before-features-leave-secure-zonequality-freeze-feature-schemas-per-model-versionprove-ship-one-feature-at-a-timeprove-measure-lift-against-feature-ablated-variantprove-kill-features-that-dont-earn-maintenanceprove-dedicate-random-exploration-slice-to-new-featuresprove-retain-feature-free-baseline-permanentlyreferences/playbooks/discovering.mdreferences/_sections.mdgotchas.mdreferences/playbooks/discovering.mdreferences/assets/templates/_template.mdmarketplace-personalisationmarketplace-search-recsys-planningmarketplace-pre-member-personalisation| File | Description |
|---|---|
| references/_sections.md | Category definitions, impact ordering, cascade rationale |
| references/playbooks/discovering.md | End-to-end feature discovery playbook |
| gotchas.md | Accumulated feature-engineering diagnostic lessons (living) |
| assets/templates/_template.md | Template for authoring new rules |
| metadata.json | Version, discipline, authoritative references |