Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

微博文件的爬取后的分布 #20

Open
Kedreamix opened this issue Mar 13, 2023 · 2 comments
Open

微博文件的爬取后的分布 #20

Kedreamix opened this issue Mar 13, 2023 · 2 comments

Comments

@Kedreamix
Copy link

你好,我想咨询一下,就是在爬取了所有的评论的文件以后,后续又对其进行了操作么,我看到了一个excel分月和正文分月类似的操作,想问一下这些文件夹的构造大概是如何的,能够大概讲一讲做了什么样的操作吗,我好像在爬取部分没有看到

@Kedreamix
Copy link
Author

顺便再问一个问题,在爬取的代码里面,为什么一开始就有正文的xlsx文件,这个是怎么来的呀

@stay-leave
Copy link
Owner

正文的爬虫是用的别的,有链接。分月只是将其按照时间弄个切片,先用Excel操作,然后将其转为TXT即可

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants