Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding report functionality to ml feed #6

Open
jay-dhanwant-yral opened this issue Feb 21, 2025 · 3 comments
Open

Adding report functionality to ml feed #6

jay-dhanwant-yral opened this issue Feb 21, 2025 · 3 comments
Assignees

Comments

@jay-dhanwant-yral
Copy link
Contributor

jay-dhanwant-yral commented Feb 21, 2025

To create a report functionality that feeds negative signal to ml feed for serving the negatively engaged videos less

@jay-dhanwant-yral jay-dhanwant-yral self-assigned this Feb 21, 2025
@jay-dhanwant-yral
Copy link
Contributor Author

Proposed architecture

graph TB
    subgraph "Batch Processing Layer (Cloud Composer)"
        A[Video Embeddings] --> B[K-means Clustering]
        B --> C[Cluster Assignment]
        D[User Interactions] --> E[User Preference Calculator]
        
        C --> F[(Video Clusters Table)]
        E --> G[(User Preferences Table)]
        B --> H[(Cluster Metadata Table)]
    end

    subgraph "Real-time Recommendation Layer (Fly.io)"
        I[User Request] --> J[Load User Profile]
        J --> K[Candidate Generation]
        
        subgraph "Parallel Candidate Generation"
            K --> L1[Personalized Candidates]
            K --> L2[Similarity-based Candidates]
            K --> L3[Diverse Candidates]
            
            L1 --> M[Candidate Merger]
            L2 --> M
            L3 --> M
        end
        
        M --> N[Re-ranking]
        N --> O[Final Feed]
    end

    subgraph "Data Storage (BigQuery)"
        P[(Video Embeddings)]
        Q[(Watch History)]
        R[(User Interactions)]
        
        P --> A
        R --> D
        Q --> J
    end

    subgraph "Caching Layer"
        S[Offchain?]
        S -- Cache --> J
        S -- Cache --> K
    end

    %% Data flow connections
    F --> K
    G --> J
    H --> K
Loading

@siyara-m-yral
Copy link

  • Should take a week for implementation. But need to connect with Komal for backend work

@komal-sai-yral komal-sai-yral transferred this issue from yral-dapp/hot-or-not-web-leptos-ssr Mar 5, 2025
@jay-dhanwant-yral
Copy link
Contributor Author

  • Create report endpoint in ml feed server
  • Create corresponding tables and dags
  • Refactor the feed to incorporate the user level block list

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants