
Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Reinforcement Learning and so on.
Stars: 2073

This repository contains a curated list of deep learning papers focused on industrial applications such as search, recommendation, and advertising. The papers cover various topics including embedding, matching, ranking, large models, transfer learning, and reinforcement learning.
README:
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Pre-Ranking, Ranking (CTR/CVR prediction), Post Ranking, Relevance, LLM, Reinforcement Learning and so on.
-
2013 (Google) (NIPS) [Word2vec] Distributed Representations of Words and Phrases and their Compositionality
-
2014 (KDD) [DeepWalk] DeepWalk - online learning of social representations
-
2015 (WWW) [LINE] LINE Large-scale Information Network Embedding
-
2016 (KDD) [Node2vec] node2vec - Scalable Feature Learning for Networks
-
2017 (ICLR) [GCN] Semi-supervised Classification with Graph Convolutional Networks
-
2017 (KDD) [Struc2vec] struc2vec - Learning Node Representations from Structural Identity
-
2017 (NIPS) [GraphSAGE] Inductive Representation Learning on Large Graphs
-
2018 (Alibaba) (KDD) *[Alibaba Embedding] Billion-scale Commodity Embedding for E-commerce Recommendation in Alibaba
-
2018 (ICLR) [GAT] Graph Attention Networks
-
2018 (Pinterest) (KDD) *[PinSage] Graph Convolutional Neural Networks for Web-Scale Recommender Systems
-
2018 (WSDM) [NetMF] Network embedding as matrix factorization - Unifying deepwalk, line, pte, and node2vec
-
2019 (Alibaba) (KDD) *[GATNE] Representation Learning for Attributed Multiplex Heterogeneous Network
-
1998 (Microsoft) Empirical Analysis of Predictive Algorithms for Collaborative Filtering
-
2003 (Amazon) Amazon.com recommendations - item-to-item collaborative filtering
-
2009 (Computer) [MF] Matrix factorization techniques for recommender systems
-
2013 (Microsoft) (CIKM) [DSSM] Learning deep structured semantic models for web search using clickthrough data
-
2015 (KDD) [Sceptre] Inferring Networks of Substitutable and Complementary Products
-
2016 (Google) (RecSys) **[Youtube DNN] Deep Neural Networks for YouTube Recommendations
-
2018 (Airbnb) (KDD) *[Airbnb Embedding] Real-time Personalization using Embeddings for Search Ranking at Airbnb
-
2018 (Alibaba) (KDD) * [TDM] Learning Tree-based Deep Model for Recommender Systems
-
2018 (Pinterest) (KDD) *[PinSage] Graph Convolutional Neural Networks for Web-Scale Recommender Systems
-
2019 (Alibaba) (CIKM) **[MIND] Multi-Interest Network with Dynamic Routing for Recommendation at Tmall
-
2019 (Alibaba) (CIKM) *[SDM] SDM - Sequential deep matching model for online large-scale recommender system
-
2019 (Alibaba) (NIPS) *[JTM] Joint Optimization of Tree-based Index and Deep Model for Recommender Systems
-
2019 (Amazon) (KDD) Semantic Product Search
-
2019 (Baidu) (KDD) *[MOBIUS] MOBIUS - Towards the Next Generation of Query-Ad Matching in Baidu's Sponsored Search
-
2019 (Google) (RecSys) **[Two-Tower] Sampling-Bias-Corrected Neural Modeling for Large Corpus Item Recommendations
-
2019 (Google) (WSDM) *[Top-K Off-Policy] Top-K Off-Policy Correction for a REINFORCE Recommender System
-
2019 [Tencent] (KDD) A User-Centered Concept Mining System for Query and Document Understanding at Tencent
-
2020 (Alibaba) (Arxiv) [SWING] Large Scale Product Graph Construction for Recommendation in E-commerce
-
2020 (Alibaba) (ICML) [OTM] Learning Optimal Tree Models under Beam Search
-
2020 (Alibaba) (KDD) *[ComiRec] Controllable Multi-Interest Framework for Recommendation
-
2020 (Facebook) (KDD) **[Embedding for Facebook Search] Embedding-based Retrieval in Facebook Search
-
2020 (Google) (WWW) *[MNS] Mixed Negative Sampling for Learning Two-tower Neural Networks in Recommendations
-
2020 (JD) (CIKM) *[DecGCN] Decoupled Graph Convolution Network for Inferring Substitutable and Complementary Items
-
2020 (JD) (SIGIR) [DPSR] Towards Personalized and Semantic Retrieval - An End-to-EndSolution for E-commerce Search via Embedding Learning
-
2020 (Microsoft) (Arxiv) TwinBERT - Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval
-
2021 (Alibaba) (KDD) * [MGDSPR] Embedding-based Product Retrieval in Taobao Search
-
2021 (Alibaba) (SIGIR) * [PDN] Path-based Deep Network for Candidate Item Matching in Recommenders
-
2021 (Amazon) (KDD) Extreme Multi-label Learning for Semantic Matching in Product Search
-
2021 (Baidu) (KDD) Pre-trained Language Model for Web-scale Retrieval in Baidu Search
-
2021 (Bytedance) (Arxiv) [DR] Deep Retrieval - Learning A Retrievable Structure for Large-Scale Recommendations
-
2021 (Meituan) (DLP-KDD) [DAT]A Dual Augmented Two-tower Model for Online Large-scale Recommendation
-
2022 (Alibaba) (CIKM) **[NANN] Approximate Nearest Neighbor Search under Neural Similarity Metric for Large-Scale Recommendation
-
2022 (Alibaba) (CIKM) [CLE-QR] Query Rewriting in TaoBao Search
-
2022 (Alibaba) **(CIKM) [MOPPR] Multi-Objective Personalized Product Retrieval in Taobao Search
-
2024 (Bytedance) (KDD) [Trinity] Trinity - Syncretizing Multi-:Long-Tail:Long-Term Interests All in One
-
2024 (Meta) (Arxiv) ** [GR] Actions Speak Louder than Words - Trillion-Parameter Sequential Transducers for Generative Recommendations
-
2025 (Bytedance) (Arxiv) [LongRetriever] LongRetriever - Towards Ultra-Long Sequence based Candidate Retrieval for Recommendation
-
2025 (Bytedance) (KDD) [VQ] Real-time Indexing for Large-scale Recommendation by Streaming Vector Quantization Retriever
-
2025 (JD) (KDD) [UniERF] UniERF - A Uniform Embedding-based Retrieval Framework for E-Commerce Search
-
2025 (Meta) (KDD) [MTMH] Optimizing Recall or Relevance? A Multi-Task Multi-Head Approach for Item-to-Item Retrieval in Recommendation
-
2025 (Meta) [RADAR ] RADAR - Recall Augmentation through Deferred Asynchronous Retrieval
-
2017 (Arxiv) (Meta) [FAISS] Billion-scale similarity search with GPUs
-
2020 (PAMI) [HNSW] Efficient and Robust Approximate Nearest Neighbor Search Using Hierarchical Navigable Small World Graphs
-
2021 (TPAMI) [IVF-PQ] Product Quantization for Nearest Neighbor Search
-
2017 (ICLR) [GCN] Semi-Supervised Classification with Graph Convolutional Networks
-
2018 (ICLR) [GAT] Graph Attention Networks
-
2018 (Pinterest) (KDD) [PinSage] Graph Convolutional Neural Networks for Web-Scale Recommender Systems
-
2019 (Alibaba) (KDD) [IntentGC] IntentGC - a Scalable Graph Convolution Framework Fusing Heterogeneous Information for Recommendation
-
2019 (Alibaba) (KDD) [MEIRec] Metapath-guided Heterogeneous Graph Neural Network for Intent Recommendation
-
2019 (Alibaba) (SIGIR) [GIN] Graph Intention Network for Click-through Rate Prediction in Sponsored Search
-
2020 (Alibaba) (SIGIR) [ATBRG] ATBRG - Adaptive Target-Behavior Relational Graph Network for Effective Recommendation
-
2021 (Baidu) (KDD) Pre-trained Language Model for Web-scale Retrieval in Baidu Search
-
2023 (Google) (NIPS) [TIGER] Recommender Systems with Generative Retrieval
-
2024 (Alibaba) (WWW) [BEQUE] Large Language Model based Long-tail Query Rewriting in Taobao Search
-
2024 (Kuaishou) (Arxiv) [KuaiFormer] KuaiFormer - Transformer-Based Retrieval at Kuaishou
-
2024 (Meta) (Arxiv) ** [GR] Actions Speak Louder than Words - Trillion-Parameter Sequential Transducers for Generative Recommendations
-
2024 (Meta) (Arxiv) Unifying Generative and Dense Retrieval for Sequential Recommendation
-
2025 (Kuaishou) (Arxiv)[OneRec] OneRec - Unifying Retrieve and Rank with Generative Recommender and Preference Alignment
-
2025 (Alibaba) (Arxiv) [TBGRecall] TBGRecall - A Generative Retrieval Model for E-commerce Recommendation Scenarios
-
2025 (Arxiv) (Tencent) [RARE] Real-time Ad retrieval via LLM-generative Commercial Intention for Sponsored Search Advertising
-
2025 (Baidu) (Arxiv) [COBRA] Sparse Meets Dense -Unified Generative Recommendations with Cascaded Sparse-Dense Representations
-
2025 (JD) (Arxiv) [GRAM] Generative Retrieval and Alignment Model - A New Paradigm for E-commerce Retrieval
-
2025 (Kuaishou) (Arxiv) [LARM] LLM-Alignment Live-Streaming Recommendationpdf
-
2025 (Kuaishou) (Arxiv) [LEARN] LEARN - Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
-
2025 (Meta) (Arxiv) [DRAMA] DRAMA - Diverse Augmentation from Large Language Models to Smaller Dense Retrievers
-
2025 (Meta) (Arxiv) [ROO] Request-Only Optimization for Recommendation Systems
-
2025 (Pinterest) [PinRec] PinRec - Outcome-Conditioned, Multi-Token Generative Retrieval for Industry-Scale Recommendation Systems
-
2025 (Xiaohongshu) NoteLLM-2 - Multimodal Large Representation Models for Recommendation
-
2020 (Alibaba) (DLP-KDD) [COLD] COLD - Towards the Next Generation of Pre-Ranking System
-
2023 (Alibaba) (CIKM) [COPR] COPR - Consistency-Oriented Pre-Ranking for Online Advertising
-
2023 (Alibaba) (KDD) [ASMOL] Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System
-
2025 (Tencent) (Arxiv) [HIT] HIT Model - A Hierarchical Interaction-Enhanced Two-Tower Model for Pre-Ranking Systems
-
2014 (ADKDD) (Facebook) Practical Lessons from Predicting Clicks on Ads at Facebook
-
2016 (Google) (DLRS) **[Wide & Deep] Wide & Deep Learning for Recommender Systems
-
2016 (Google) (RecSys) **[Youtube DNN] Deep Neural Networks for YouTube Recommendations
-
2018 (Alibaba) (KDD) **[DIN] Deep Interest Network for Click-Through Rate Prediction
-
2019 (Alibaba) (AAAI) **[DIEN] Deep Interest Evolution Network for Click-Through Rate Prediction
-
2019 (CIKM) ** [AutoInt] AutoInt -Automatic Feature Interaction Learning via Self-Attentive Neural Networks
-
2019 (Facebook) (Arxiv) [DLRM] (Facebook) Deep Learning Recommendation Model for Personalization and Recommendation Systems, Facebook
-
2019 (Google) (Recsys) ** [Youtube Multi-task] Recommending what video to watch next - a multitask ranking system
-
2020 (Alibaba) (Arxiv) ** [SIM] Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction
-
2020 (Alibaba) (NIPS) Neuron-level Structured Pruning using Polarization Regularizer
-
2020 (JD) (CIKM) **[DMT] Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender Systems
-
2020 (Tencent) (Recsys) ** [PLE] Progressive Layered Extraction (PLE) - A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations
-
2021 (Alibaba) (CIKM) * [ZEUS] Self-Supervised Learning on Users’ Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce
-
2021 (Alibaba) (CIKM) [STAR] One Model to Serve All - Star Topology Adaptive Recommender for Multi-Domain CTR Prediction
-
2021 (Google) (WWW) * [DCN V2] DCN V2 - Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems
-
2022 (Alibaba) (Arxiv) ** [ETA] Efficient Long Sequential User Data Modeling for Click-Through Rate Prediction
-
2022 (Alibaba) (WSDM) Modeling Users’ Contextualized Page-wise Feedback for Click-Through Rate Prediction in E-commerce Search
-
2022 (Meta) ** (Arxiv) DHEN - A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
-
2023 (Alibaba) (Arxiv) [ESLM] Entire Space Learning Framework - Unbias Conversion Rate Prediction in Full Stages of Recommender System
-
2023 (Google) (Arxiv) On the Factory Floor - ML Engineering for Industrial-Scale Ads Recommendation Models
-
2023 (Google) ** (Arxiv) [Hiformer] Hiformer - Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
-
2023 (Kuaishou) (Arixiv) [TWIN] TWIN - TWo-stage Interest Network for Lifelong User Behavior Modeling in CTR Prediction at Kuaishou
-
2023 (Kuaishou) (KDD) [PEPNet] PEPNet - Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information
-
2024 (Kuaishou) (CIKM) [TWINv2] TWIN V2 - Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at Kuaishou
-
2024 (Meta) (Arxiv) ** [GR] Actions Speak Louder than Words - Trillion-Parameter Sequential Transducers for Generative Recommendations
-
2024 (Meta) ** (PMLR) [Wukong] Wukong - Towards a Scaling Law for Large-Scale Recommendation
-
2025 (Bytedance) ** (Arxiv) [LONGER] LONGER - Scaling Up Long Sequence Modeling in Industrial Recommenders
-
2025 (Bytedance) ** (Arxiv) [RankMixer] RankMixer - Scaling Up Ranking Models in Industrial Recommenders
-
2014 (ADKDD) (Facebook) Practical Lessons from Predicting Clicks on Ads at Facebook
-
2014 (TIST) Simple and scalable response prediction for display advertising
-
2023 Classifier Calibration with ROC-Regularized Isotonic Regression
-
2016 (ICLR) [GRU4Rec] Session-based Recommendations with Recurrent Neural Networks
-
2017 (Amazon) (IEEE) Two decades of recommender systems at Amazon.com
-
2019 (KDD) (Airbnb) Applying Deep Learning To Airbnb Search
-
2020 (Airbnb) (KDD) Improving Deep Learning For Airbnb Search
-
2008 (KDD) Learning Classifiers from Only Positive and Unlabeled Data
-
2014 (Criteo) (KDD) [DFM] Modeling Delayed Feedback in Display Advertising
-
2018 (Arxiv) [NoDeF] A Nonparametric Delayed Feedback Model for Conversion Rate Prediction
-
2019 (Twitter) (RecSys) Addressing Delayed Feedback for Continuous Training with Neural Networks in CTR prediction
-
2020 (AdKDD) Delayed Feedback Model with Negative Binomial Regression for Multiple Conversions
-
2020 (JD) (IJCAI) [TS-DL] An Attention-based Model for Conversion Rate Prediction with Delayed Feedback via Post-click Calibration
-
2020 (SIGIR) [DLA-DF] Dual Learning Algorithm for Delayed Conversions
-
2020 (WWW) [FSIW] A Feedback Shift Correction in Predicting Conversion Rates under Delayed Feedback
-
2021 (Alibaba) (AAAI) [ES-DFM] Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling
-
2021 (Alibaba) (AAAI) [ESDF] Delayed Feedback Modeling for the Entire Space Conversion Rate Prediction
-
2021 (Alibaba) (Arxiv) [Defer] Real Negatives Matter - Continuous Training with Real Negatives for Delayed Feedback Modeling
-
2021 (Google) (Arxiv) Handling many conversions per click in modeling delayed feedback
-
2021 (Tencent) (SIGIR) Counterfactual Reward Modification for Streaming Recommendation with Delayed Feedback
-
2022 (Alibaba) (WWW) [DEFUSE] Asymptotically Unbiased Estimation for Delayed Feedback Modeling via Label Correction
-
2020 (Alibaba) (KDD) *[Privileged Features Distillation] Privileged Features Distillation at Taobao Recommendations
-
2025 (Bytedance) (KDD) [HA-PFD] Hardness-aware Privileged Features Distillation with Latent Alignment for CVR Prediction
-
2025 (Kuaishou) [MIKD] Mutual Information-aware Knowledge Distillation for Short Video Recommendation
-
2010 (ICDM) [FM] Factorization machines
-
2013 (Google) (KDD) [LR] Ad Click Prediction - a View from the Trenches
-
2016 (Arxiv) [PNN] Product-based Neural Networks for User Response Prediction
-
2016 (Criteo) (Recsys) [FFM] Field-aware Factorization Machines for CTR Prediction
-
2016 (ECIR) [FNN] Deep Learning over Multi-field Categorical Data – A Case Study on User Response Prediction
-
2016 (KDD) [Deepintent] Deepintent - Learning attentions for online advertising with recurrent neural networks
-
2016 (Microsoft) (KDD) [Deep Crossing] Deep Crossing - Web-scale modeling without manually crafted combinatorial features
-
2017 (Google) (ADKDD) [DCN] Deep & CrossNetwork for Ad Click Predictions
-
2017 (Huawei) (IJCAI) [DeepFM] DeepFM - A Factorization-Machine based Neural Network for CTR Prediction
-
2017 (IJCAI) [AFM] Attentional Factorization Machines Learning the Weight of Feature Interactions via Attention Networks
-
2017 (SIGIR) [NFM] Neural Factorization Machines for Sparse Predictive Analytics
-
2017 (WWW) [NCF] Neural Collaborative Filtering
-
2018 (Google) (WSDM) [Latent Cross] Latent Cross Making Use of Context in Recurrent Recommender Systems
-
2018 (KDD) [xDeepFM] xDeepFM - Combining Explicit and Implicit Feature Interactions for Recommender Systems
-
2018 (TOIS) [PNN] Product-Based Neural Networks for User Response Prediction over Multi-Field Categorical Data
-
2019 (CIKM) ** [AutoInt] AutoInt - Automatic Feature Interaction Learning via Self-Attentive Neural Networks
-
2019 (Huawei) (WWW) [FGCNN] Feature Generation by Convolutional Neural Network for Click-Through Rate Prediction
-
2019 (Sina) (Arxiv) [FAT-DeepFFM] FAT-DeepFFM - Field Attentive Deep Field-aware Factorization Machine
-
2019 (Tencent) (AAAI) [IFM] Interaction-aware Factorization Machines for Recommender Systems
-
2020 (Baidu) (KDD) [CAN] Combo-Attention Network for Baidu Video Advertising
-
2021 (Google) (WWW) * [DCN V2] DCN V2 - Improved Deep & Cross Network and Practical Lessons for Web-scale Learning to Rank Systems
-
2021 (Sina) (DLP-KDD) [MaskNet] MaskNet - Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided Mask
-
2022 (Alibaba) (WSDM) * [CAN] CAN - Feature Co-Action Network for Click-Through Rate Prediction
-
2022 (Meta) ** (Arxiv) DHEN - A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
-
2023 (CIKM) * [GDCN] Towards Deeper, Lighter and Interpretable Cross Network for CTR Prediction
-
2023 (Google) ** (Arxiv) [Hiformer] Hiformer - Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
-
2023 (Sina) (CIKM) [MemoNet] MemoNet - Memorizing All Cross Features’ Representations Efficiently via Multi-Hash Codebook Network for CTR Prediction
-
2024 (Meta) ** (PMLR) [Wukong] Wukong - Towards a Scaling Law for Large-Scale Recommendation
-
2024 (LinkedIn) (KDD) [RDCN] LiRank - Industrial Large Scale Ranking Models at LinkedIn
-
2025 (Bytedance) ** (Arxiv) [RankMixer] RankMixer - Scaling Up Ranking Models in Industrial Recommenders
-
2025 (Meta) (Arxiv) [InterFormer] InterFormer - Effective Heterogeneous Interaction Learning for Click-Through Rate Prediction
-
2025 (Tencent) (Arxiv) [D-MoE] Enhancing CTR Prediction with De-correlated Expert Networks
-
2019 (CIKM) [AutoInt] AutoInt -Automatic Feature Interaction Learning via Self-Attentive Neural Networks
-
2020 (Arxiv) Scaling Laws for Neural Language Models
-
2021 (Baidu) (KDD) Pre-trained Language Model based Ranking in Baidu Search
-
2021 (Google) (Arxiv) [MLP-Mixer] MLP-Mixer - An all-MLP Architecture for Vision
-
2022 (Meta) ** (Arxiv) DHEN - A Deep and Hierarchical Ensemble Network for Large-Scale Click-Through Rate Prediction
-
2023 (Arxiv) [E4SRec] E4SRec - An Elegant Effective Efficient Extensible Solution of Large Language Models for Sequential Recommendation
-
2023 (Google) ** (Arxiv) [Hiformer] Hiformer - Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
-
2024 (Alibaba) (Arxiv) [BAHE] Breaking the Length Barrier - LLM-Enhanced CTR Prediction in Long Textual User Behaviors
-
2024 (Bytedance) (Arxiv) [HLLM] HLLM - Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User Modeling
-
2024 (Google) (Arxiv) LLMs for User Interest Exploration in Large-scale Recommendation Systems
-
2024 (Google) (Arxiv) [CALRec] CALRec - Contrastive Alignment of Generative LLMs for Sequential Recommendation
-
2024 (Google) (ICLR) From Sparse to Soft Mixtures of Experts
-
2024 (Kuaishou) (Arxiv) [LEARN] LEARN - Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
-
2024 (Kuaishou) (KDD) [NAR4Rec] Non-autoregressive Generative Models for Reranking Recommendation
-
2024 (Meituan) (Arxiv) [SRP4CTR] Enhancing CTR Prediction through Sequential Recommendation Pre-training - Introducing the SRP4CTR Framework
-
2024 (Meta) (Arxiv) ** [GR] Actions Speak Louder than Words - Trillion-Parameter Sequential Transducers for Generative Recommendations
-
2024 (Meta) (Arxiv) Unifying Generative and Dense Retrieval for Sequential Recommendation
-
2024 (Meta) (Arxiv) [SUM] Scaling User Modeling - Large-scale Online User Representations for Ads Personalization in Meta
-
2024 (Meta) ** (PMLR) [Wukong] Wukong - Towards a Scaling Law for Large-Scale Recommendation
-
2025 (Kuaishou) (Arxiv)[OneRec] OneRec - Unifying Retrieve and Rank with Generative Recommender and Preference Alignment
-
2025 (Alibaba) (Arxiv) Unlocking Scaling Law in Industrial Recommendation Systems with a Three-step Paradigm based Large User Model
-
2025 (Alibaba) (Arxiv) [HeterRec] Hierarchical Causal Transformer with Heterogeneous Information for Expandable Sequential Recommendation
-
2025 (Alibaba) (Arxiv) [LREA] Efficient Long Sequential Low-rank Adaptive Attention for Click-through rate Prediction
-
2025 (Alibaba) (Arxiv) [URM] Large Language Models Are Universal Recommendation Learners
-
2025 (Alibaba) (KDD) [GPSD] Scaling Transformers for Discriminative Recommendation via Generative Pretraining
-
2025 (Alibaba) (WWW) Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning
-
2025 (Arxiv) (Pinterest) [PinRec] PinRec - Outcome-Conditioned, Multi-Token Generative Retrieval for Industry-Scale Recommendation Systems
-
2025 (Arxiv) (Xiaohongshu) [GenRank] Towards Large-scale Generative Ranking
-
2025 (Baidu) (Arxiv) [COBRA] Sparse Meets Dense -Unified Generative Recommendations with Cascaded Sparse-Dense Representations
-
2025 (Bytedance) ** (Arxiv) [LONGER] LONGER - Scaling Up Long Sequence Modeling in Industrial Recommenders
-
2025 (Bytedance) ** (Arxiv) [RankMixer] RankMixer - Scaling Up Ranking Models in Industrial Recommenders
-
2025 (Google) (Arxiv) User Feedback Alignment for LLM-powered Exploration in Large-scale Recommendation Systems
-
2025 (Google) (Arxiv) [STAR] STAR - A Simple Training-free Approach for Recommendations using Large Language Models
-
2025 (Google) ** (Arxiv) [Hiformer] Hiformer - Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
-
2025 (Kuaishou) (Arxiv) [GenSAR] Unified Generative Search and Recommendation
-
2025 (Kuaishou) (Arxiv) [LARM] LLM-Alignment Live-Streaming Recommendationpdf
-
2025 (Kuaishou) (Arxiv) [LEARN] LEARN - Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application
-
2025 (Kuaishou) (Arxiv) [OneRec-V2] OneRec Technical Report v2
-
2025 (Kuaishou) (Arxiv) [OneRec] OneRec - Unifying Retrieve and Rank with Generative Recommender and Preference Alignment
-
2025 (Kuaishou) (Arxiv) [OneRec] OneRec Technical Report
-
2025 (Kuaishou) (Arxiv) [OneSearch] OneSearch - A Preliminary Exploration of the Unified End-to-End Generative Framework for E-commerce Search
-
2025 (Kuaishou) (Arxiv) [OneSug] OneSug - The Unified End-to-End Generative Framework for E-commerce Query Suggestion
-
2025 (Meituan) (Arxiv) [DFGR] Action is All You Need - Dual-Flow Generative Ranking Network for Recommendation
-
2025 (Meituan) (Arxiv) [MTGR] MTGR - Industrial-Scale Generative Recommendation Framework in Meituan
-
2025 (Meituan) (Arxiv) [UniROM] One Model to Rank Them All - Unifying Online Advertising with End-to-End Learning
-
2025 (Meta) (Arxiv) [HyperCast] Realizing Scaling Laws in Recommender Systems - A Foundation–Expert Paradigm for Hyperscale Model Deployment
-
2025 (Microsoft) (KDD)Towards Web-scale Recommendations with LLMs - From Quality-aware Ranking to Candidate Generation
-
2025 (Pinterest) (Arxiv) [PinFM] PinFM - Foundation Model for User Activity Sequences at a Billion-scale Visual Discovery Platform
-
2025 (eBay) (Arxiv) LLMDistill4Ads - Using Cross-Encoders to Distill from LLM Signals for Advertiser Keyphrase Recommendations at eBay
-
2015 (Twitter) (KDD) Click-through Prediction for Advertising in Twitter Timeline
-
2022 (Google) (KDD) Scale Calibration of Deep Ranking Models
-
2023 (Alibaba) (KDD) Joint Optimization of Ranking and Calibration with Contextualized Hybrid Model
-
2023 (Google) (CIKM) Regression Compatible Listwise Objectives for Calibrated Ranking with Binary Relevance
-
2024 (Tencent) (KDD) Understanding the Ranking Loss for Recommendation with Sparse User Feedback
-
2018 (Alibaba) (CIKM) [Image CTR] Image Matters - Visually Modeling User Behaviors Using Advanced Model Server
-
2024 (Alibaba) (CIKM) Enhancing Taobao Display Advertising with Multimodal Representations - Challenges, Approaches and Insights
-
2014 (TASLP) * [LHUC] Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation
-
2015 (Microsoft) (WWW) A Multi-View Deep Learning Approach for Cross Domain User Modeling in Recommendation Systems
-
2018 (Google) (KDD) ** [MMoE] Modeling task relationships in multi-task learning with multi-gate mixture-of-experts
-
2019 (Alibaba) (CIKM) [WE-CAN] Cross-domain Attention Network with Wasserstein Regularizers for E-commerce Search
-
2020 (Alibaba) (Arxiv) [SAML] Scenario-aware and Mutual-based approach for Multi-scenario Recommendation in E-Commerce
-
2020 (Alibaba) (CIKM) [HMoE] Improving Multi-Scenario Learning to Rank in E-commerce by Exploiting Task Relationships in the Label Space
-
2020 (Alibaba)(CIKM) [MiNet] MiNet - Mixed Interest Network for Cross-Domain Click-Through Rate Prediction
-
2020 (Tencent) (Recsys) ** [PLE] Progressive Layered Extraction (PLE) - A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations
-
2021 (Alibaba) (CIKM) * [ZEUS] Self-Supervised Learning on Users’ Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce
-
2021 (Alibaba) (CIKM) ** [STAR] One Model to Serve All - Star Topology Adaptive Recommender for Multi-Domain CTR Prediction
-
2021 (Google) (ICLR) HyperGrid Transformers - Towards A Single Model for Multiple Tasks
-
2021 (Kwai) (Arxiv) [POSO] POSO - Personalized Cold Start Modules for Large-scale Recommender Systems
-
2022 (Alibaba) (CIKM) AdaSparse - Learning Adaptively Sparse Structures for Multi-Domain Click-Through Rate Prediction
-
2022 (Alibaba) (NIPS) ** [APG] APG - Adaptive Parameter Generation Network for Click-Through Rate Prediction
-
2023 (Alibaba) (CIKM) [HC2] Hybrid Contrastive Constraints for Multi-Scenario Ad Ranking
-
2023 (Alibaba) (CIKM) [MMN] Masked Multi-Domain Network - Multi-Type and Multi-Scenario Conversion Rate Prediction with a Single Model
-
2023 (Alibaba) (CIKM) [Rec4Ad] Rec4Ad - A Free Lunch to Mitigate Sample Selection Bias for Ads CTR Prediction in Taobao
-
2023 (Alibaba) (SIGIR) [MARIA] Multi-Scenario Ranking with Adaptive Feature Learning
-
2023 (CIKM) [HAMUR] HAMUR - Hyper Adapter for Multi-Domain Recommendation
-
2023 (Huawei) (CIKM) [DFFM] DFFM - Domain Facilitated Feature Modeling for CTR Prediction
-
2023 (Kuaishou) (KDD) * [PEPNet] PEPNet - Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information
-
2023 (Tencent) (KDD) Scenario-Adaptive Feature Interaction for Click-Through Rate Prediction
-
2024 (Alibaba) (CIKM) * [MultiLoRA] MultiLoRA - Multi-Directional Low-Rank Adaptation for Multi-Domain Recommendation
-
2024 (Alibaba) (RecSys) * [MLoRA] MLoRA - Multi-Domain Low-Rank Adaptive Network for Click-Through Rate Prediction
-
2024 (Kuaishou) (SIGIR) [M3oE] M3oE - Multi-Domain Multi-Task Mixture-of-Experts Recommendation Framework
-
2024 (Tencent) (KDD) [LCN] Cross-Domain LifeLong Sequential Modeling for Online Click-Through Rate Prediction
-
2024 (WSDM) Exploring Adapter-based Transfer Learning for Recommender Systems - Empirical Studies and Practical Insights
-
2025 (Kuaishou) (KDD) [HoME] HoME - Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou
-
(2018) (ICML) GradNorm - Gradient Normalization for Adaptive Loss Balancing in Deep Multitask Networks
-
2014 (TASLP) [LHUC] Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation
-
2017 (Google) (ICLR) [Sparsely-Gated MOE] Outrageously large neural networks - The sparsely-gated mixture-of-experts layer
-
2018 (Alibaba) (KDD) [DUPN] Perceive Your Users in Depth - Learning Universal User Representations from Multiple E-commerce Tasks
-
2018 (Alibaba) (SIGIR) [ESMM] Entire Space Multi-Task Model - An Effective Approach for Estimating Post-Click Conversion Rate
-
2018 (CVPR) Multi-Task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics
-
2018 (Google) (KDD) ** [MMoE] Modeling task relationships in multi-task learning with multi-gate mixture-of-experts
-
2019 (Alibaba) (CIKM) Multi-task based Sales Predictions for Online Promotions
-
2019 (Alibaba) (Recys) A Pareto-Eficient Algorithm for Multiple Objective Optimization in E-Commerce Recommendation
-
2019 (Google) (AAAI) SNR Sub-Network Routing for Flexible Parameter Sharing in Multi-Task Learning
-
2019 (Google) (Recsys) ** [Youtube Multi-task] Recommending what video to watch next - a multitask ranking system
-
2019 (NIPS) Pareto Multi-Task Learning
-
2020 (Alibaba) (SIGIR) [ESM2] Entire Space Multi-Task Modeling via Post-Click Behavior Decomposition for Conversion Rate Prediction
-
2020 (Alibaba) (WWW) Large-scale Causal Approaches to Debiasing Post-click Conversion Rate Estimation with Multi-task Learning
-
2020 (Amazon) (WWW) Multi-Objective Ranking Optimization for Product Search Using Stochastic Label Aggregation
-
2020 (Google) (KDD) [MoSE] Multitask Mixture of Sequential Experts for User Activity Streams
-
2020 (JD) (CIKM) *[DMT] Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender Systems
-
2020 (Tencent) (Recsys) ** [PLE] Progressive Layered Extraction (PLE) - A Novel Multi-Task Learning (MTL) Model for Personalized Recommendations
-
2021 (Alibaba) (SIGIR) [HM3] Hierarchically Modeling Micro and Macro Behaviors via Multi-Task Learning for Conversion Rate Prediction
-
2021 (Alibaba) (SIGIR) [MSSM] MSSM - A Multiple-level Sparse Sharing Model for Efficient Multi-Task Learning
-
2021 (Baidu) (SIGIR) [GemNN] GemNN - Gating-Enhanced Multi-Task Neural Networks with Feature Interaction Learning for CTR Prediction
-
2021 (Google) (Arxiv) [DSelect-k] DSelect-k Differentiable Selection in the Mixture of Experts with Applications to Multi-Task Learning
-
2021 (Google) (KDD) Understanding and Improving Fairness-Accuracy Trade-offs in Multi-Task Learning
-
2021 (JD) (ICDE) Adversarial Mixture Of Experts with Category Hierarchy Soft Constraint
-
2021 (Meituan) (KDD) Modeling the Sequential Dependence among Audience Multi-step Conversions with Multi-task Learning in Targeted Display Advertising
-
2021 (Tencent) (Arxiv) Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling
-
2021 (Tencent) (WWW) Personalized Approximate Pareto-Efficient Recommendation
-
2022 (Google) (WWW) Can Small Heads Help? Understanding and Improving Multi-Task Generalization
-
2023 (Airbnb) (KDD) Optimizing Airbnb Search Journey with Multi-task Learning
-
2023 (Alibaba) (CIKM) [DTRN] Deep Task-specific Bottom Representation Network for Multi-Task Recommendation
-
2023 (Google) (CIKM) Multitask Ranking System for Immersive Feed and No More Clicks - A Case Study of Short-Form Video Recommendation
-
2023 (Google) (KDD) Improving Training Stability for Multitask Ranking Models in Recommender Systems
-
2023 (Meta) (KDD) AdaTT - Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations
-
2024 (Airbnb) (KDD) Multi-objective Learning to Rank by Model Distillation
-
2024 (Kuaishou) (KDD) [GradCraft] GradCraft - Elevating Multi-task Recommendations through Holistic Gradient Crafting
-
2024 (Kuaishou) [HoME] HoME - Hierarchy of Multi-Gate Experts for Multi-Task Learning at Kuaishou
-
2024 (Shopee) (KDD) [ResFlow] Residual Multi-Task Learner for Applied Ranking
-
2024 (Tencent) (KDD) [STEM] Ads Recommendation in a Collapsed and Entangled World
-
2025 (Baidu) (KDD) [RankExpert] RankExpert - A Mixture of Textual-and-Behavioral Experts for Multi-Objective Learning-to-Rank in Web Search
-
2025 (Alibaba) (CIKM) [MAL] See Beyond a Single View - Multi-Attribution Learning Leads to Better Conversion Rate Prediction
-
2014 (Baidu) (OSDI) Scaling Distributed Machine Learning with the Parameter Server
-
2019 (Alibaba) (DLP-KDD) [XDL] XDL - An Industrial Deep Learning Framework for High-dimensional Sparse Data
-
2020 (Bytedance) (OSDI) [BytePS] A Unified Architecture for Accelerating Distributed DNN Training in Heterogeneous GPU:CPU Clusters
-
2022 (Kuaishou) (KDD) [Persia] Persia - An Open, Hybrid System Scaling Deep Learning-based Recommenders up to 100 Trillion Parameters
-
2014 (TASLP) * [LHUC] Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation
-
2018 (CVPR) * [SENet] Squeeze-and-Excitation Networks
-
2019 (Sina) (Recsys) [FiBiNET] FiBiNET - combining feature importance and bilinear feature interaction for click-through rate prediction
-
2020 (Sina) (Arxiv) [GateNet] GateNet - Gating-Enhanced Deep Network for Click-Through Rate Prediction
-
2023 (Kuaishou) (KDD) [PEPNet] PEPNet - Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information
-
2023 (Sina) (CIKM) [FiBiNet++] FiBiNet++ - Reducing Model Size by Low Rank Feature Interaction Layer for CTR Prediction
-
2019 (Alibaba) (IJCAI) [DeepMCP] Representation Learning-Assisted Click-Through Rate Prediction
-
2019 (SIGIR) [BERT4Rec] (Alibaba) (SIGIR2019) BERT4Rec - Sequential Recommendation with Bidirectional Encoder Representations from Transformer
-
2016 (Google) (RecSys) **[Youtube DNN] Deep Neural Networks for YouTube Recommendations
-
2017 (Google) (NIPS) ** Attention Is All You Need
-
2018 (Alibaba) (KDD) **[DIN] Deep Interest Network for Click-Through Rate Prediction
-
2018 (Alibaba) (KDD) [DUPN] Perceive Your Users in Depth - Learning Universal User Representations from Multiple E-commerce Tasks
-
2019 (Alibaba) (AAAI) **[DIEN] Deep Interest Evolution Network for Click-Through Rate Prediction
-
2019 (Alibaba) (IJCAI) [DSIN] Deep Session Interest Network for Click-Through Rate Prediction
-
2019 (Alibaba) (KDD) [BST] Behavior Sequence Transformer for E-commerce Recommendation in Alibaba
-
2019 (Alibaba) (KDD) [DSTN] Deep Spatio-Temporal Neural Networks for Click-Through Rate Prediction
-
2019 (Alibaba) (WWW) [TiSSA] TiSSA - A Time Slice Self-Attention Approach for Modeling Sequential User Behaviors
-
2019 (Tencent) (KDD) [RALM] TReal-time Attention Based Look-alike Model for Recommender System
-
2020 (Alibaba) (SIGIR) [DHAN] Deep Interest with Hierarchical Attention Network for Click-Through Rate Prediction
-
2020 (Google) (KDD) [Google Drive] Improving Recommendation Quality in Google Drive
-
2020 (JD) (CIKM) **[DMT] Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender Systems
-
2020 (JD) (NIPS) [KFAtt] Kalman Filtering Attention for User Behavior Modeling in CTR Prediction
-
2020 (JD) (WSDM) [HUP] Hierarchical User Profiling for E-commerce Recommender Systems
-
2022 (Alibaba) (WSDM) [RACP] Modeling Users’ Contextualized Page-wise Feedback for Click-Through Rate Prediction in E-commerce Search
-
2022 (JD) (WWW) Implicit User Awareness Modeling via Candidate Items for CTR Prediction in Search Ads
-
2022 (WWW) [FMLP] Filter-enhanced MLP is All You Need for Sequential Recommendation
-
2023 (JD) (CIKM) [IUI] IUI - Intent-Enhanced User Interest Modeling for Click-Through Rate Prediction
-
2023 (Meituan) (CIKM) [DCIN] Deep Context Interest Network for Click-Through Rate Prediction
-
2023 (Pinterest) (KDD) TransAct - Transformer-based Realtime User Action Model for Recommendation at Pinterest
-
2025 (Bytedance) ** (Arxiv) [LONGER] LONGER - Scaling Up Long Sequence Modeling in Industrial Recommenders
-
2025 (Kuaishou) (SIGIR) [FIM] FIM - Frequency-Aware Multi-View Interest Modeling for Local-Life Service Recommendation
-
2019 (Alibaba) (KDD) [MIMN] Practice on Long Sequential User Behavior Modeling for Click-Through Rate Prediction
-
2019 (Google) (WWW) Towards Neural Mixture Recommender for Long Range Dependent User Sequences
-
2020 (Alibaba) (Arxiv) ** [SIM] Search-based User Interest Modeling with Lifelong Sequential Behavior Data for Click-Through Rate Prediction
-
2020 (ICLR) Reformer - The Efficient Transformer
-
2020 (SIGIR) [UBR4CTR] User Behavior Retrieval for Click-Through Rate Prediction
-
2021 (Alibaba) (Arxiv) [ETA] End-to-End User Behavior Retrieval in Click-Through Rate Prediction Model
-
2022 (Alibaba) (Arxiv) ** [ETA] Efficient Long Sequential User Data Modeling for Click-Through Rate Prediction
-
2022 (Meituan) (CIKM) [SDIM] Sampling Is All You Need on Modeling Long-Term User Behaviors for CTR Prediction
-
2023 (Kuaishou) (Arixiv) [TWIN] TWIN - TWo-stage Interest Network for Lifelong User Behavior Modeling in CTR Prediction at Kuaishou
-
2023 (Kuaishou) (CIKM) [QIN] Query-dominant User Interest Network for Large-Scale Search Ranking
-
2024 (Kuaishou) (CIKM) [TWINv2] TWIN V2 - Scaling Ultra-Long User Behavior Sequence Modeling for Enhanced CTR Prediction at Kuaishou
-
2024 (Tencent) (KDD) [LCN] Cross-Domain LifeLong Sequential Modeling for Online Click-Through Rate Prediction
-
2025 (Kuaishou) (KDD) [HiT-LBM] Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model
-
2025 (Meta) (KDD) DV365 - Extremely Long User History Modeling at Instagram
-
2025 (Pinterest) (Arxiv) [TransActV2]TransAct V2 - Lifelong User Action Sequence Modeling on Pinterest Recommendation
-
2014 (Google) (NIPS) [Knoledge Distillation] Distilling the Knowledge in a Neural Network
-
2015 (ICLR) [Fitnets] Fitnets - Hints for thin deep nets
-
2018 (Alibaba) (AAAI) [Rocket] Rocket launching - A universal and efficient framework for training well-performing light net
-
2018 (KDD)[Ranking Distillation] Ranking distillation - Learning compact ranking models with high performance for recommender system
-
-
-
1998 (SIGIR) ** [MRR] The Use of MMR, Diversity-Based Reranking for Reordering Documents and Producing Summaries
-
2005 (WWW) Improving Recommendation Lists Through Topic Diversification
-
2008 (SIGIR) [α-NDCG] Novelty and Diversity in Information Retrieval Evaluation
-
2009 (Microsoft) (WSDM) Diversifying Search Results
-
2010 (WWW) Exploiting Query Reformulations for Web Search Result Diversification
-
2016 (Amazon) (RecSys) Adaptive, Personalized Diversity for Visual Discovery
-
2017 (Hulu) (NIPS) [DPP] Fast Greedy MAP Inference for Determinantal Point Process to Improve Recommendation Diversity
-
2018 (Alibaba) (IJCAI) [Alibaba GMV] Globally Optimized Mutual Influence Aware Ranking in E-Commerce Search
-
2018 (Google) (CIKM) [DPP] Practical Diversified Recommendations on YouTube with Determinantal Point Processes
-
2018 (SIGIR) [DLCM] Learning a Deep Listwise Context Model for Ranking Refinement
-
2019 (Alibaba) (WWW) [Value-based RL] Value-aware Recommendation based on Reinforcement Profit Maximization
-
2019 (Alibaba) (KDD) [GAttN] Exact-K Recommendation via Maximal Clique Optimization
-
2019 (Alibaba) (RecSys) ** [PRM] Personalized Re-ranking for Recommendation
-
2019 (Google) (Arxiv) Reinforcement Learning for Slate-based Recommender Systems - A Tractable Decomposition and Practical Methodology
-
2019 (Google) (Arxiv) Seq2slate - Re-ranking and slate optimization with rnns
-
2019 (Google) (IJCAI) [SlateQ] SLATEQ - A Tractable Decomposition for Reinforcement Learning with Recommendation Sets
-
2020 (Airbnb) (KDD) Managing Diversity in Airbnb Search
-
2020 (Alibaba) (CIKM) [EdgeRec] EdgeRec - Recommender System on Edge in Mobile Taobao
-
2020 (Huawei) (Arxiv) Personalized Re-ranking for Improving Diversity in Live Recommender Systems
-
2021 (Alibaba) (Arxiv) [PRS] Revisit Recommender System in the Permutation Prospective
-
2021 (Google) (WSDM) User Response Models to Improve a REINFORCE Recommender System
-
2021 (Microsoft) Diversity on the Go! Streaming Determinantal Point Processes under a Maximum Induced Cardinality Objective
-
2023 (Amazon) (KDD) RankFormer - Listwise Learning-to-Rank Using Listwide Labels
-
2023 (Meituan) (KDD) PIER - Permutation-Level Interest-Based End-to-End Re-ranking Framework in E-commerce
-
2024 (Kuaishou) (KDD) [NAR4Rec] Non-autoregressive Generative Models for Reranking Recommendation
-
2015 (Google) (Arxiv) Deep Reinforcement Learning in Large Discrete Action Spaces
-
2015 (Google) (Arxiv) Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
-
2017 (KDD) [DCM] Deep Choice Model Using Pointer Networks for Airline Itinerary Prediction
-
2018 (Microsoft) (EMNLP) [RL4NMT] A study of reinforcement learning for neural machine translation
-
2019 (Google) (Arxiv) Seq2slate - Re-ranking and slate optimization with rnns
-
2020 (ICLR) [StructBERT] StructBERT - Incorporating Language Structures into Pre-training for Deep Language Understanding
-
2021 (Alibaba) (WWW) [MASM] Learning a Product Relevance Model from Click-Through Data in E-Commerce
-
2023 (Meituan) (CIKM) [SPM] SPM - Structured Pretraining and Matching Architectures for Relevance Modeling in Meituan Search
-
2024 (Alibaba) (KDD) [DeepBoW] Deep Bag-of-Words Model - An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce
-
2024 (Walmart) (SIGIR) Large Language Models for Relevance Judgment in Product Search
-
2025 (Alibaba) (WWW) [ELLM] Explainable LLM-driven Multi-dimensional Distillation for E-Commerce Relevance Learning
-
2025 (Tencent) (KDD) [GenFR] Applying Large Language Model For Relevance Search In Tencent
-
2024 (Alibaba) (CIKM) [SimTier] Enhancing Taobao Display Advertising with Multimodal Representations - Challenges, Approaches and Insights
-
2024 (Kuaishou) (Arxiv) End-to-end training of Multimodal Model and ranking Model
-
2024 (Kuaishou) (Arxiv) [QARM] QARM - Quantitative Alignment Multi-Modal Recommendation at Kuaishou
-
2025 (Alibaba) (Arixv) MIM - Multi-modal Content Interest Modeling Paradigm for User Behavior Modeling
-
2025 (Baidu) (KDD) Large Vison-Language Foundation Model in Baidu AIGC Image Advertising
-
2025 (Baidu) (KDD) Multi-Branch Collaborative Learning Network for Video Quality Assessment in Industrial Video Search
-
2025 (Kuaishou) (Arxiv) [HCMRM] HCMRM -A High-Consistency Multimodal Relevance Model for Search Ads
-
2013 (Google) (NIPS) [Word2vec] Distributed Representations of Words and Phrases and their Compositionality
-
2014 (Google) (NIPS) [Seq2Seq] Sequence to Sequence Learning with Neural Networks
-
2017 (Google) (NIPS) [Transformer] Attention Is All You Need
-
2017 (OpenAI) (NIPS) [RLHF] Deep Reinforcement Learning from Human Preferences
-
2018 (OpenAI) (Arxiv) [GPT-1] Improving Language Understanding by Generative Pre-Training
-
2019 (Google) (NAACL) [Bert] BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding
-
2019 (OpenAI) (Arxiv) [GPT-2] Language Models are Unsupervised Multitask Learners
-
2020 (Arxiv) Scaling Laws for Neural Language Models
-
2020 (Meta) (NIPS) [RAG] Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
-
2020 (OpenAI) (Arxiv) [GPT-3] Language Models are Few-Shot Learners
-
2021 (Microsoft) (Arxiv) [LoRA] LoRA - Low-Rank Adaptation of Large Language Models
-
2022 (Google) (Arxiv) [PaLM] PaLM - Scaling Language Modeling with Pathways
-
2022 (Google) (JMLR) [SwitchTransfomers] Switch Transformers - Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
-
2022 (Google) (NIPS) [COT] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
-
2022 (Google) (NIPS) [ChainOfThought] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
-
2022 (Google) (TMLR) [Emergent] Emergent Abilities of Large Language Models
-
2022 (OpenAI) (Arxiv) [InstructGPT] [RLHF] Training language models to follow instructions with human feedback
-
2022 (OpenAI) (Arxiv) [WebGPT] Learning to summarize from human feedback
-
2022 (OpenAI) (Arxiv) [WebGPT] WebGPT - Browser-assisted question-answering with human feedback
-
2023 (Alibaba) (Arxiv) [QWEN] QWEN Technical Report
-
2023 (Meta) (Arxiv) [LLaMA-2] Llama 2 - Open Foundation and Fine-Tuned ChatModels
-
2023 (Meta) (Arxiv) [LLaMA] LLaMA - Open and Efficient Foundation Language Models
-
2023 (OpenAI) (Arxiv) [GPT4] GPT-4 Technical Report
-
2025 (Alibaba) (Arxiv) [QWEN-2.5] QWEN 2.5 Technical Report
-
2025 (Alibaba) (Arxiv) [Qwen2.5-VL] Qwen2.5-VL Technical Report
-
2025 (Alibaba) (Arxiv) [Qwen3 Embedding] Qwen3 Embedding - Advancing Text Embedding and Reranking Through Foundation Models
-
2025 (Alibaba) (Arxiv) [Qwen3] Qwen3 Technical Report
-
2025 (Arxiv) A Survey of Large Language Models
-
2025 (Google) (Arxiv) [SigLIP2] SigLIP 2 - Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
-
2025 [DeepSeek-R1] DeepSeek-R1 -Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
-
2025 [DeepSeek-V3] DeepSeek-V3 Technical Report
-
-
-
-
-
-
-
-
-
resources
-
2020 (Alibaba) (AAAI) [DMR] Deep Match to Rank Model for Personalized Click-Through Rate Prediction
-
2020 (Alibaba) (CIKM) [BERT4Rec] BERT4Rec - Sequential Recommendation with Bidirectional Encoder Representations from Transformer
-
2020 (Alibaba) (KDD) Disentangled Self-Supervision in Sequential Recommenders
-
2020 (Arxiv) UserBERT - Self-supervised User Representation Learning
-
2020 (Arxiv) [SGL] Self-supervised Graph Learning for Recommendation
-
2020 (CIKM) [S3Rec] S3-Rec - Self-Supervised Learning for Sequential Recommendation with Mutual Information Maximization
-
2020 (EMNLP) [PTUM] PTUM - Pre-training User Model from Unlabeled User Behaviors via Self-supervision
-
2020 (SIGIR) Self-Supervised Reinforcement Learning for Recommender Systems
-
2021 (Alibaba) (Arxiv) [CLRec] Contrastive Learning for Debiased Candidate Generation in Large-Scale Recommender Systems
-
2021 (Alibaba) (CIKM) * [ZEUS] Self-Supervised Learning on Users’ Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce
-
2021 (Alibaba) (WWW) Contrastive Pre-training for Sequential Recommendation
-
2021 (Google) (CIKM) Self-supervised Learning for Large-scale Item Recommendations
-
2021 (WSDM) [Prop] PROP - Pre-training with Representative Words Prediction for Ad-hoc Retrieval
-
2010 (Yahoo) (WWW) [LinUCB] A Contextual-Bandit Approach to Personalized News Article Recommendation
-
2018 (Spotify) (Recsys) [Spotify Bandit] Explore, Exploit, and Explain Personalizing Explainable Recommendations with Bandits
-
2018 [Microsoft] (WWW) [DRN] DRN - A Deep Reinforcement Learning Framework for News Recommendation
-
2019 (Google) (IJCAI) *[SlateQ] SLATEQ - A Tractable Decomposition for Reinforcement Learning with Recommendation Sets
-
2019 (Google) (WSDM) *[Top-K Off-Policy] Top-K Off-Policy Correction for a REINFORCE Recommender System
-
2019 (Sigweb) Deep Reinforcement Learning for Search, Recommendation, and Online Advertising - A Survey
-
2020 (Bytedance) (KDD) [RAM] Jointly Learning to Recommend and Advertise
-
2020 (JD) (SIGIR) [NICF] Neural Interactive Collaborative Filtering
-
2023 (Airbnb) (KDD) Optimizing Airbnb Search Journey with Multi-task Learning
-
2023 (Alibaba) (KDD) Capturing Conversion Rate Fluctuation during Sales Promotions - A Novel Historical Data Reuse Approach
-
2023 (Amazon) (KDD) RankFormer - Listwise Learning-to-Rank Using Listwide Labels
-
2023 (Baidu) (KDD) Learning Discrete Document Representations in Web Search
-
2023 (Baidu) (KDD) S2phere - Semi-Supervised Pre-training for Web Search over Heterogeneous Learning to Rank Data
-
2023 (Google) (KDD) Improving Training Stability for Multitask Ranking Models in Recommender Systems
-
2023 (Kuaishou) (KDD) Tree based Progressive Regression Model for Watch-Time Prediction in Short-video Recommendation
-
2023 (Kuaishou) (KDD) [PEPNet] PEPNet - Parameter and Embedding Personalized Network for Infusing with Personalized Prior Information
-
2023 (Meituan) (KDD) PIER - Permutation-Level Interest-Based End-to-End Re-ranking Framework in E-commerce
-
2023 (Meta) (KDD) AdaTT - Adaptive Task-to-Task Fusion Network for Multitask Learning in Recommendations
-
2023 (Microsoft) (KDD) Unifier - A Unified Retriever for Large-Scale Retrieval
-
2023 (Pinterest) (KDD) TransAct - Transformer-based Realtime User Action Model for Recommendation at Pinterest
-
2023 (Tencent) (KDD) Binary Embedding-based Retrieval at Tencent
-
2023 (Tencent) (KDD) CT4Rec - Simple yet Effective Consistency Training for Sequential Recommendation
-
2023 (Tencent) (KDD) Scenario-Adaptive Feature Interaction for Click-Through Rate Prediction
-
2024 (Airbnb) (KDD) Multi-objective Learning to Rank by Model Distillation
-
2024 (Bytedance) (KDD) [Trinity] Trinity - Syncretizing Multi-:Long-Tail:Long-Term Interests All in One
-
2024 (Kuaishou) (KDD) [GradCraft] GradCraft - Elevating Multi-task Recommendations through Holistic Gradient Crafting
-
2024 (Kuaishou) (KDD) [NAR4Rec] Non-autoregressive Generative Models for Reranking Recommendation
-
2024 (Shopee) (KDD) [ResFlow] Residual Multi-Task Learner for Applied Ranking
-
2024 (Tencent) (KDD) Understanding the Ranking Loss for Recommendation with Sparse User Feedback
-
2024 (Tencent) (KDD) [BBP] Beyond Binary Preference - Leveraging Bayesian Approaches for Joint Optimization of Ranking and Calibration
-
2024 (Tencent) (KDD) [LCN] Cross-Domain LifeLong Sequential Modeling for Online Click-Through Rate Prediction
-
2024 (Tencent) (KDD) [STEM] Ads Recommendation in a Collapsed and Entangled World
-
2014 (Google) (NIPS) [Knoledge Distillation] Distilling the Knowledge in a Neural Network
-
2015 (Google) (Arxiv) Deep Reinforcement Learning in Large Discrete Action Spaces
-
2015 (Google) (Arxiv) Deep Reinforcement Learning with Attention for Slate Markov Decision Processes with High-Dimensional States and Actions
-
2016 (Google) (DLRS) **[Wide & Deep] Wide & Deep Learning for Recommender Systems
-
2016 (Google) (RecSys) **[Youtube DNN] Deep Neural Networks for YouTube Recommendations
-
2017 (Google) (ICLR) [Sparsely-Gated MOE] Outrageously large neural networks - The sparsely-gated mixture-of-experts layer
-
2018 (Google) (CIKM) [DPP] Practical Diversified Recommendations on YouTube with Determinantal Point Processes
-
2018 (Google) (KDD) [MMoE] Modeling task relationships in multi-task learning with multi-gate mixture-of-experts
-
2019 (Google) (Arxiv) Seq2slate - Re-ranking and slate optimization with rnns
-
2019 (Google) (IJCAI) *[SlateQ] SLATEQ - A Tractable Decomposition for Reinforcement Learning with Recommendation Sets
-
2019 (Google) (IJCAI) [SlateQ] SLATEQ - A Tractable Decomposition for Reinforcement Learning with Recommendation Sets
-
2019 (Google) (Recsys)[Youtube Multi-task] Recommending what video to watch next - a multitask ranking system
-
2019 (Google) (WSDM) *[Top-K Off-Policy] Top-K Off-Policy Correction for a REINFORCE Recommender System
-
2020 (Google) (Arxiv) Self-supervised Learning for Large-scale Item Recommendations
-
2020 (Google) (KDD) [Google Drive] Improving Recommendation Quality in Google Drive
-
2020 (Google) (KDD) [MoSE] Multitask Mixture of Sequential Experts for User Activity Streams
-
2020 (JD) (CIKM) *[DMT] Deep Multifaceted Transformers for Multi-objective Ranking in Large-Scale E-commerce Recommender Systems
-
2020 (JD) (CIKM) *[DecGCN] Decoupled Graph Convolution Network for Inferring Substitutable and Complementary Items
-
2020 (JD) (SIGIR) [NICF] Neural Interactive Collaborative Filtering
-
2020 (JD) (WSDM) [HUP] Hierarchical User Profiling for E-commerce Recommender Systems
-
2018 (Alibaba) (IJCAI) [Alibaba GMV] Globally Optimized Mutual Influence Aware Ranking in E-Commerce Search
-
2018 (Alibaba) (IJCAI) [JUMP] JUMP - A Joint Predictor for User Click and Dwell Time
-
2018 (Alibaba) (KDD) [DUPN] Perceive Your Users in Depth - Learning Universal User Representations from Multiple E-commerce Tasks
-
2020 (Alibaba) (CIKM) [TIEN] Deep Time-Aware Item Evolution Network for Click-Through Rate Prediction
-
2020 (Alibaba) (NIPS) Neuron-level Structured Pruning using Polarization Regularizer
-
2021 (Alibaba) (AAAI) [ANPP] Attentive Neural Point Processes for Event Forecasting
-
2021 (Alibaba) (AAAI) [ES-DFM] Capturing Delayed Feedback in Conversion Rate Prediction via Elapsed-Time Sampling
-
2021 (Alibaba) (CIKM) [ZEUS] Self-Supervised Learning on Users’ Spontaneous Behaviors for Multi-Scenario Ranking in E-commerce
-
2021 (Alibaba) (KDD) [MGDSPR] Embedding-based Product Retrieval in Taobao Search
-
2022 (Alibaba) (CIKM) [CLE-QR] Query Rewriting in TaoBao Search
-
2022 (Alibaba) (CIKM) [MOPPR] Multi-Objective Personalized Product Retrieval in Taobao Search
-
2023 (Alibaba) (KDD) [ASMOL] Rethinking the Role of Pre-ranking in Large-scale E-Commerce Searching System
For Tasks:
Click tags to check more tools for each tasksFor Jobs:
Alternative AI tools for Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
Similar Open Source Tools

Awesome-Deep-Learning-Papers-for-Search-Recommendation-Advertising
This repository contains a curated list of deep learning papers focused on industrial applications such as search, recommendation, and advertising. The papers cover various topics including embedding, matching, ranking, large models, transfer learning, and reinforcement learning.

Awesome-LLM-Ensemble
Awesome-LLM-Ensemble is a collection of papers on LLM Ensemble, focusing on the comprehensive use of multiple large language models to benefit from their individual strengths. It provides a systematic review of recent developments in LLM Ensemble, including taxonomy, methods for ensemble before, during, and after inference, benchmarks, applications, and related surveys.

Awesome-LM-SSP
The Awesome-LM-SSP repository is a collection of resources related to the trustworthiness of large models (LMs) across multiple dimensions, with a special focus on multi-modal LMs. It includes papers, surveys, toolkits, competitions, and leaderboards. The resources are categorized into three main dimensions: safety, security, and privacy. Within each dimension, there are several subcategories. For example, the safety dimension includes subcategories such as jailbreak, alignment, deepfake, ethics, fairness, hallucination, prompt injection, and toxicity. The security dimension includes subcategories such as adversarial examples, poisoning, and system security. The privacy dimension includes subcategories such as contamination, copyright, data reconstruction, membership inference attacks, model extraction, privacy-preserving computation, and unlearning.

Awesome-Graph-LLM
Awesome-Graph-LLM is a curated collection of research papers exploring the intersection of graph-based techniques with Large Language Models (LLMs). The repository aims to bridge the gap between LLMs and graph structures prevalent in real-world applications by providing a comprehensive list of papers covering various aspects of graph reasoning, node classification, graph classification/regression, knowledge graphs, multimodal models, applications, and tools. It serves as a valuable resource for researchers and practitioners interested in leveraging LLMs for graph-related tasks.

chatgpt.js
chatgpt.js is a powerful JavaScript library that allows for super easy interaction w/ the ChatGPT DOM. * Feature-rich * Object-oriented * Easy-to-use * Lightweight (yet optimally performant)

chatgpt-auto-refresh
ChatGPT Auto Refresh is a userscript that keeps ChatGPT sessions fresh by eliminating network errors and Cloudflare checks. It removes the 10-minute time limit from conversations when Chat History is disabled, ensuring a seamless experience. The tool is safe, lightweight, and a time-saver, allowing users to keep their sessions alive without constant copy/paste/refresh actions. It works even in background tabs, providing convenience and efficiency for users interacting with ChatGPT. The tool relies on the chatgpt.js library and is compatible with various browsers using Tampermonkey, making it accessible to a wide range of users.

Awesome-LLM-Post-training
The Awesome-LLM-Post-training repository is a curated collection of influential papers, code implementations, benchmarks, and resources related to Large Language Models (LLMs) Post-Training Methodologies. It covers various aspects of LLMs, including reasoning, decision-making, reinforcement learning, reward learning, policy optimization, explainability, multimodal agents, benchmarks, tutorials, libraries, and implementations. The repository aims to provide a comprehensive overview and resources for researchers and practitioners interested in advancing LLM technologies.

awesome-cuda-tensorrt-fpga
Okay, here is a JSON object with the requested information about the awesome-cuda-tensorrt-fpga repository:

LLM-for-misinformation-research
LLM-for-misinformation-research is a curated paper list of misinformation research using large language models (LLMs). The repository covers methods for detection and verification, tools for fact-checking complex claims, decision-making and explanation, claim matching, post-hoc explanation generation, and other tasks related to combating misinformation. It includes papers on fake news detection, rumor detection, fact verification, and more, showcasing the application of LLMs in various aspects of misinformation research.

chatgpt-widescreen
ChatGPT Widescreen Mode is a browser extension that adds widescreen and fullscreen modes to ChatGPT, enhancing chat sessions by reducing scrolling and creating a more immersive viewing experience. Users can experience clearer programming code display, view multi-step instructions or long recipes on a single page, enjoy original content in a visually pleasing format, customize features like a larger chatbox and hidden header/footer, and use the tool with chat.openai.com and poe.com. The extension is compatible with various browsers and relies on code from the chatgpt.js library under the MIT license.

Awesome-LLMs-in-Graph-tasks
This repository is a collection of papers on leveraging Large Language Models (LLMs) in Graph Tasks. It provides a comprehensive overview of how LLMs can enhance graph-related tasks by combining them with traditional Graph Neural Networks (GNNs). The integration of LLMs with GNNs allows for capturing both structural and contextual aspects of nodes in graph data, leading to more powerful graph learning. The repository includes summaries of various models that leverage LLMs to assist in graph-related tasks, along with links to papers and code repositories for further exploration.

eleeye
ElephantEye is a free Chinese Chess program that follows the GNU Lesser General Public Licence. It is designed for chess enthusiasts and programmers to use freely. The program works as a XiangQi engine for XQWizard with strong AI capabilities. ElephantEye supports UCCI 3.0 protocol and offers various parameter settings for users to customize their experience. The program uses brute-force chess algorithms and static position evaluation techniques to search for optimal moves. ElephantEye has participated in computer chess competitions and has been tested on various online chess platforms. The source code of ElephantEye is available on SourceForge for developers to explore and improve.
For similar tasks

design-studio
Tiledesk Design Studio is an open-source, no-code development platform for creating chatbots and conversational apps. It offers a user-friendly, drag-and-drop interface with pre-ready actions and integrations. The platform combines the power of LLM/GPT AI with a flexible 'graph' approach for creating conversations and automations with ease. Users can automate customer conversations, prototype conversations, integrate ChatGPT, enhance user experience with multimedia, provide personalized product recommendations, set conditions, use random replies, connect to other tools like HubSpot CRM, integrate with WhatsApp, send emails, and seamlessly enhance existing setups.

koordinator
Koordinator is a QoS based scheduling system for hybrid orchestration workloads on Kubernetes. It aims to improve runtime efficiency and reliability of latency sensitive workloads and batch jobs, simplify resource-related configuration tuning, and increase pod deployment density. It enhances Kubernetes user experience by optimizing resource utilization, improving performance, providing flexible scheduling policies, and easy integration into existing clusters.

Streamer-Sales
Streamer-Sales is a large model for live streamers that can explain products based on their characteristics and inspire users to make purchases. It is designed to enhance sales efficiency and user experience, whether for online live sales or offline store promotions. The model can deeply understand product features and create tailored explanations in vivid and precise language, sparking user's desire to purchase. It aims to revolutionize the shopping experience by providing detailed and unique product descriptions to engage users effectively.

ChatGPT
ChatGPT is a desktop application available on Mac, Windows, and Linux that provides a powerful AI wrapper experience. It allows users to interact with AI models for various tasks such as generating text, answering questions, and engaging in conversations. The application is designed to be user-friendly and accessible to both beginners and advanced users. ChatGPT aims to enhance the user experience by offering a seamless interface for leveraging AI capabilities in everyday scenarios.

chatflow
Chatflow is a tool that provides a chat interface for users to interact with systems using natural language. The engine understands user intent and executes commands for tasks, allowing easy navigation of complex websites/products. This approach enhances user experience, reduces training costs, and boosts productivity.

zodiac
Zodiac is a frontend tool designed to interact with Google's Gemini Pro using vanilla JS. It provides a user-friendly interface for accessing the functionalities of Google AI Suite. The tool simplifies the process of utilizing Gemini Pro's capabilities through a straightforward web application. Users can easily integrate their API key and access various features offered by Google AI Suite. Zodiac aims to streamline the interaction with Gemini Pro and enhance the user experience by offering a simple and intuitive interface for managing AI tasks.

deforum-comfy-nodes
Deforum for ComfyUI is an integration tool designed to enhance the user experience of using ComfyUI. It provides custom nodes that can be added to ComfyUI to improve functionality and workflow. Users can easily install Deforum for ComfyUI by cloning the repository and following the provided instructions. The tool is compatible with Python v3.10 and is recommended to be used within a virtual environment. Contributions to the tool are welcome, and users can join the Discord community for support and discussions.

ChatGPT-Plugins
ChatGPT-Plugins is a repository containing plugins for ChatGPT-Next-Web. These plugins provide additional functionalities and features to enhance the ChatGPT experience. Users can easily deploy these plugins using Vercel or HuggingFace. The repository includes README files in English and Vietnamese for user guidance.
For similar jobs

weave
Weave is a toolkit for developing Generative AI applications, built by Weights & Biases. With Weave, you can log and debug language model inputs, outputs, and traces; build rigorous, apples-to-apples evaluations for language model use cases; and organize all the information generated across the LLM workflow, from experimentation to evaluations to production. Weave aims to bring rigor, best-practices, and composability to the inherently experimental process of developing Generative AI software, without introducing cognitive overhead.

LLMStack
LLMStack is a no-code platform for building generative AI agents, workflows, and chatbots. It allows users to connect their own data, internal tools, and GPT-powered models without any coding experience. LLMStack can be deployed to the cloud or on-premise and can be accessed via HTTP API or triggered from Slack or Discord.

VisionCraft
The VisionCraft API is a free API for using over 100 different AI models. From images to sound.

kaito
Kaito is an operator that automates the AI/ML inference model deployment in a Kubernetes cluster. It manages large model files using container images, avoids tuning deployment parameters to fit GPU hardware by providing preset configurations, auto-provisions GPU nodes based on model requirements, and hosts large model images in the public Microsoft Container Registry (MCR) if the license allows. Using Kaito, the workflow of onboarding large AI inference models in Kubernetes is largely simplified.

PyRIT
PyRIT is an open access automation framework designed to empower security professionals and ML engineers to red team foundation models and their applications. It automates AI Red Teaming tasks to allow operators to focus on more complicated and time-consuming tasks and can also identify security harms such as misuse (e.g., malware generation, jailbreaking), and privacy harms (e.g., identity theft). The goal is to allow researchers to have a baseline of how well their model and entire inference pipeline is doing against different harm categories and to be able to compare that baseline to future iterations of their model. This allows them to have empirical data on how well their model is doing today, and detect any degradation of performance based on future improvements.

tabby
Tabby is a self-hosted AI coding assistant, offering an open-source and on-premises alternative to GitHub Copilot. It boasts several key features: * Self-contained, with no need for a DBMS or cloud service. * OpenAPI interface, easy to integrate with existing infrastructure (e.g Cloud IDE). * Supports consumer-grade GPUs.

spear
SPEAR (Simulator for Photorealistic Embodied AI Research) is a powerful tool for training embodied agents. It features 300 unique virtual indoor environments with 2,566 unique rooms and 17,234 unique objects that can be manipulated individually. Each environment is designed by a professional artist and features detailed geometry, photorealistic materials, and a unique floor plan and object layout. SPEAR is implemented as Unreal Engine assets and provides an OpenAI Gym interface for interacting with the environments via Python.

Magick
Magick is a groundbreaking visual AIDE (Artificial Intelligence Development Environment) for no-code data pipelines and multimodal agents. Magick can connect to other services and comes with nodes and templates well-suited for intelligent agents, chatbots, complex reasoning systems and realistic characters.