Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

作者您好!想请教一下Yelp数据集中loadMat.py报错的问题 #16

Open
fujiaxu0207 opened this issue Jun 12, 2024 · 4 comments

Comments

@fujiaxu0207
Copy link

2024-06-12 20:52:24.284970
Traceback (most recent call last):
File "loadMat.py", line 165, in
ratingMat,trustMat,categoryMat=filterData(ratingMat,trustMat,categoryMat)
File "loadMat.py", line 74, in filterData
trustMat=trustMat[idx][:,idx]
File "/root/anaconda3/envs/xytf/fjx_HGCL/lib/python3.7/site-packages/scipy/sparse/_index.py", line 33, in getitem
row, col = self._validate_indices(key)
File "/root/anaconda3/envs/xytf/fjx_HGCL/lib/python3.7/site-packages/scipy/sparse/_index.py", line 138, in _validate_indices
row = self._asindices(row, M)
File "/root/anaconda3/envs/xytf/fjx_HGCL/lib/python3.7/site-packages/scipy/sparse/_index.py", line 170, in _asindices
raise IndexError('index (%d) out of range' % max_indx)
IndexError: index (1968701) out of range

请问为什么越界了呢?使用的是dataraw里面的数据

@Mrzjk
Copy link

Mrzjk commented Aug 31, 2024

我看哪个loadMat文件中是为了节省内存,我把那一行注释也可以运行

@fujiaxu0207
Copy link
Author

我看哪个loadMat文件中是为了节省内存,我把那一行注释也可以运行

应该是的

@cmr123456
Copy link
Collaborator

非常感谢您们的问题,之前yelp数据集上传的原始数据rawdata里的trust.csv是已经处理过的,由于误操作导致。然后因为内存大小的限制,我把yelp数据集的原始数据集rawdata已经放在/dataset目录下的README.md里展示的Google Drive的地址。有什么问题,欢迎继续提问。

@FZYTina
Copy link

FZYTina commented Dec 1, 2024

您好,[Google Drive]的地址需要访问权限,无法下载Yelp原始数据集,方便的话请提供新的地址,感谢!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants