Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Find similar paper (article) #1

Open
fmauri90 opened this issue Sep 17, 2019 · 0 comments
Open

Find similar paper (article) #1

fmauri90 opened this issue Sep 17, 2019 · 0 comments

Comments

@fmauri90
Copy link
Collaborator

fmauri90 commented Sep 17, 2019

https://app.zenhub.com/files/206553149/c1c8834a-0664-4148-b31b-723a09295473/download

The goal of this paper is therefore to explore how language models can be
used to compare research paper abstracts, how they can best make use of the
other document features, and whether they are a more reasonable choice than
a vector space model based approach for this task. In particular, the authors combine
two ideas to address these questions. On the one hand, the authors consider the idea of
estimating language models for document features such as keywords, authors,
and journal, and derive a language model for the article by interpolating them. On the other
hand, the authors apply LDA (Latent Dirichlet Allocation) to discover latent topics in the documents, and explore how the keywords can help to improve the performance of standard LDA.

@fmauri90 fmauri90 changed the title Find similar paper Find similar paper (article) Sep 17, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant