Skip to content

Instantly share code, notes, and snippets.

Show Gist options
  • Select an option

  • Save hugobowne/a6ded130a0f0340e432fc8e9f32ec910 to your computer and use it in GitHub Desktop.

Select an option

Save hugobowne/a6ded130a0f0340e432fc8e9f32ec910 to your computer and use it in GitHub Desktop.
# An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C)
This paper addresses the resistance in clinical research to adopt NLP models due to transparency and usability issues. It proposes an open NLP framework demonstrated via the National COVID Cohort Collaborative (N3C) using COVID-19 clinical notes. The framework includes open data annotation, a community-driven ruleset platform, and synthetic text data generation. Evaluations on datasets from multiple institutions show promising results for multi-institution clinical NLP study and adoption.
# A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text
This review paper covers deep learning methods for generating Java code from natural language text, highlighting techniques ranging from RNN to Transformer-based models. It categorizes models into encoder-only, decoder-only, and encoder-decoder types, discusses datasets, evaluation metrics, and points to future research directions.
# Towards the Study of Morphological Processing of the Tangkhul Language
This work initiates morphological processing for the Tangkhul language using an unsupervised approach. Despite limited data, the morpheme identification using Morphessor shows reasonable output, proving the approach's viability in low-resource language NLP.
# An Automated Multiple-Choice Question Generation Using Natural Language Processing Techniques
This paper presents an NLP system for automatic multiple-choice question generation (MCQG) targeted at computer-based testing. It extracts important keywords from lesson materials to generate relevant questions. The system's effectiveness is validated on multiple lesson materials, showing capability in keyword extraction and question setting.
# SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Explores the novel use of prompting with pre-trained speech language models to perform various speech processing tasks unified under a speech-to-unit generation framework. The study shows competitive results with fine-tuning methods and highlights the framework's potential, especially in few-shot learning scenarios and future advanced speech LMs.
# Summary
These papers reflect diverse and advanced natural language processing research including clinical NLP frameworks, code generation from natural language, low-resource language morphology, automated educational question generation, and innovative prompting for speech models, indicating broad and impactful directions in the field.
---
### Papers:
1. https://arxiv.org/abs/2110.10780v3
2. https://arxiv.org/abs/2306.06371v1
3. https://arxiv.org/abs/2006.16212v1
4. https://arxiv.org/abs/2103.14757v1
5. https://arxiv.org/abs/2408.13040v1
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment