Skip to content

Implementation of the EMNLP'24 paper: Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

License

Notifications You must be signed in to change notification settings

amazon-science/SigExt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

SigExt: Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

This is the implementation of the EMNLP'24 paper.

Title: Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

Authors: Lei Xu, Asad Karim, Saket Dingliwal, Aparna Elangovan

Introduction

Large language models (LLMs) are highly effective at generating summaries across various domains through prompting techniques, reducing the need for dedicated training in summarization applications. However, designing prompts that guide LLMs to generate summaries with an appropriate level of detail and a coherent writing style can be challenging. Keyphrase Signal Extractor (SigExt) addresses this by leveraging salient information directly from the source document to improve summarization prompts. By integrating extracted keyphrases, SigExt enhances ROUGE F1 and recall, making generated summaries more aligned with reference texts and more complete. Additionally, the number of keyphrases provides a precision-recall trade-off, allowing for tailored summarization outputs.

Run Experiments

Here is an example on running SigExt on CNN dataset.

# Prepare datasets in jsonl format
python3 src/prepare_data.py --dataset cnn --output_dir experiments/cnn_dataset/

# Train the longformer keyphrase extractor
python3 src/train_longformer_extractor_context.py \
  --dataset_dir experiments/cnn_dataset/ \
  --checkpoint_dir experiments/cnn_extractor_model/

# Inference the longformer keyphrase extractor
python3 src/inference_longformer_extractor.py \
  --dataset_dir experiments/cnn_dataset/ \
  --checkpoint_dir experiments/cnn_extractor_model/ \
  --output_dir experiments/cnn_dataset_with_keyphrase/

# Run summarization
python3 src/zs_summarization.py \
  --model_name claude \
  --kw_strategy sigext_topk \
  --kw_model_top_k 15 \
  --dataset cnn \
  --dataset_dir experiments/cnn_dataset_with_keyphrase/ \
  --output_dir experiments/cnn_extsig_predictions/

Citation

@inproceedings{xu2024salient,
  title={Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization},
  author={Xu, Lei and Karim, Mohammed Asad and Dingliwal, Saket and Elangovan, Aparna},
  booktitle = "Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track",
  year={2024}
}

Security

See CONTRIBUTING for more information.

License

This project is licensed under the Apache-2.0 License.

About

Implementation of the EMNLP'24 paper: Salient Information Prompting to Steer Content in Prompt-based Abstractive Summarization

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages