baselines

FreqCacheEmbedding

This repo contains the implementation of FreqCacheEmbedding, which extends the vanilla PyTorch EmbeddingBag with cache mechanism to enable heterogeneous training for large scale recommendation models.

Dataset

Basically, the preprocessing processes are derived from Torchrec's utilities and Avazu kaggle community Please refer to recsys/datasets/preprocess_scripts dir to see the details.

During the time this repo was built, another commonly adopted dataset, Criteo 1TB is unavailable (see this issue). We will append its preprocessing & running scripts very soon.

Command

All the commands to run the FreqCacheEmbedding enabled recommendations models are presented in run.sh

Model

Currently, this repo only contains DLRM & DeepFM models, and we are working on testing more recommendation models.

Name		Name	Last commit message	Last commit date
parent directory ..
data		data
models		models
README.md		README.md
__init__.py		__init__.py
dlrm_main.py		dlrm_main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

baselines

baselines

README.md

FreqCacheEmbedding

Dataset

Command

Model

Files

baselines

Directory actions

More options

Directory actions

More options

Latest commit

History

baselines

Folders and files

parent directory

README.md

FreqCacheEmbedding

Dataset

Command

Model