How to load Steam dataset in python code like the ml-100k dataset? #792
-
Hi, But was wondering if I could change the dataset name "ml-100k" to the Steam dataset you have on RecBole. What should I write instead of "ml-100k" to get the steam dataset? Does it have a specific name? My goal is to test various algorithms on this dataset but I can't proceed because I can't load the Steam dataset. ps. I'm relatively new to Python. Sincerely, |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 19 replies
-
@alanjacob10 Hi, you need to download the steam dataset, and then change the config. About the dataset, you can get it from Our Google Drive. Here I recommand you to create a folder call
About the config setting, we support three way to set config(by command line, config files and parameter dicts). I will take the parameter dicts as an example: from recbole.quick_start import run_recbole
parameter_dict = {
'data_path': "The file path of the Dataset"
'model': BPR
'dataset': steam
'load_col’:
inter: [ ]
......
}
run_recbole(config_dict=parameter_dict) For more infomation about our config settings, you can read our docs. |
Beta Was this translation helpful? Give feedback.
-
Hi again, I don't know if i need to import something else from Pycharm other than recbole or what the problem is. Sincerely, |
Beta Was this translation helpful? Give feedback.
-
Let me make a summary to this question.
data_path: The path of "MyDataset" (mentioned before)
load_col: Decide which file and column you want to load;
USER_ID_FIELD: Field name of user ID feature
ITEM_ID_FIELD: Field name of item ID feature
RATING_FIELD: Field name of rating feature
TIME_FIELD: Field name of timestamp feature
|
Beta Was this translation helpful? Give feedback.
-
Hi Guys, |
Beta Was this translation helpful? Give feedback.
Let me make a summary to this question.
If you want to load a new dataset to run models, you can follow these steps:
In RecBole, we have a default dataset: ml-100k. If you want to use other dataset, you need to prepare your data and convert the raw data into
Atomic Files
(About Atomic Files, here is the docs). By the way, we have prepared some popular datasets and you can download the atomic files of these datasets from our Google Drive or Baidu Wangpan. Then, create a folder calledMyDataset
and organize the file structure like: