Experiment code on Advertising Dataset of paper Deep Session Interest Network for Click-Through Rate Prediction(https://arxiv.org/abs/1905.06482)
Yufei Feng , Fuyu Lv, Weichen Shen and Menghan Wang and Fei Sun and Yu Zhu and Keping Yang.
In Proceedings of 28th International Joint Conference on Artificial Intelligence (IJCAI 2019)
please use
pip install -r requirements.txt
to setup the operating environment in python3.6
.
- Download Dataset Ad Display/Click Data on Taobao.com
- Extract the files into the
raw_data
directory
- run
0_gen_sampled_data.py
, sample the data by user - run
1_gen_sessions.py
, generate historical session sequence for each user
- run
2_gen_din_input.py
,generate input data - run
train_din.py
- run
2_gen_dien_input.py
,generate input data(It may take a long time to sample negative samples.) - run
train_dien.py
- run
2_gen_dsin_input.py
,generate input data - run
train_dsin.py
The loss of DSIN with
bias_encoding=True
may be NaN sometimes on Advertising Dataset and it remains a confusing problem since it never occurs in the production environment.We will work on it and also appreciate your help.
This project is licensed under the terms of the Apache-2 license. See LICENSE for additional details.