NLP-89680

A repository for NLP course at uni.

Language: Python

Assignment 1

Assignment 2

PCFG (Probabilistic Context Free Grammar)
Writing a grammar that generates legal sentences in English. Exploring some (but not all) aspects of the English language, and their implementation in a grammar.

Assignment 3

Distributional Semantics
Finding similar words by meaning using a combination of algorithms, and making a detailed report comparing them.
Data: wikipedia
Algorithms:

word contexts: (a) sentence (b) window (c) dependency tree (parent\son, direction of arc, jump over preposition)
similarity: (a) cosine distance (b) PMI
order: (a) 1st order similarity (b) 2nd order similarity

Assignment 4

Relation Extraction
Given a small amount of data (news articles), extract Named Entities, from each sentence, and the relation between them.
i.e Yosi (work for) CBS

Algorithm:

For each sentence, extract Named Entities, dependency tree and POS tagging using spacy library.
Generate a sequence from the path between the two entities on the dependency tree.
Run LSTM on the path, concat output with other feature vectors, such as: Named Entity type, Named Entity POS tag.
Pass through MLP with softmax activation.

Challenges:

small dataset
missing labels (entities\relations that should have been included in the gold file)
mismatches between gold file Named Entities, and spacy output Named Entities.

Architecture choice:
pure ML approach, instead of hybrid ML and rule based. A hybrid could be made after error analysis, for example: the model sometimes confuses relation (work for) with (kill), because both relations contain PERSONs.

Name		Name	Last commit message	Last commit date
Latest commit History 335 Commits
Assignment1		Assignment1
Assignment2		Assignment2
Assignment3		Assignment3
Assignment4		Assignment4
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP-89680

Assignment 1

Assignment 2

Assignment 3

Assignment 4

About

Releases

Packages

Contributors 3

Languages

maor121/NLP-89680

Folders and files

Latest commit

History

Repository files navigation

NLP-89680

Assignment 1

Assignment 2

Assignment 3

Assignment 4

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages