Replies: 1 comment 2 replies
-
@mvOvOmg Hello, Mina. Thanks for your attention to RecBole! For the first question, you don't need to manually divide the originial dataset into three parts, just prepare the atomic files required by RecBole. In other words, your atomic file contains the entire dataset, and RecBole will divide the dataset into training, validation and testing parts. You can also change the data splitting strategy by config settings. Knowledge-based recommendation models utilize KG information to make recommendations, so it needs to specify and load the kg information of the dataset. If you want to use KG models on ml-1m datasets, you need to put USER_ID_FIELD: user_id
ITEM_ID_FIELD: item_id
HEAD_ENTITY_ID_FIELD: head_id
TAIL_ENTITY_ID_FIELD: tail_id
RELATION_ID_FIELD: relation_id
ENTITY_ID_FIELD: entity_id
load_col:
inter: [user_id, item_id]
kg: [head_id, relation_id, tail_id]
link: [item_id, entity_id] For the second question, you should generate the KG atomic files (*.kg and *.link) for MovieLens dataset based on ML-KG that you have downloaded. In order to facilitate the users using RecBole, we have developed a repository RecSysDatasets of public data sources for Recommender Systems (RS), including knowledge graph generation. You can refer to MovieLens-KG.md for more details about the conversion process. |
Beta Was this translation helpful? Give feedback.
-
Hello, developers,
I have to use originial dataset and have already split into training, valid and test set.
Without KG part, I have to rename them as A.train.inter, A.valid.inter and A.test.inter, and set 'benchmark_filename' as ['train','val','test].
I am wondering how to construct customize dataset for Knowledge Graph-based models?
Sincerely,
Mina
Beta Was this translation helpful? Give feedback.
All reactions