Created
November 18, 2025 05:54
-
-
Save hugobowne/a6ded130a0f0340e432fc8e9f32ec910 to your computer and use it in GitHub Desktop.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # An Open Natural Language Processing Development Framework for EHR-based Clinical Research: A case demonstration using the National COVID Cohort Collaborative (N3C) | |
| This paper addresses the resistance in clinical research to adopt NLP models due to transparency and usability issues. It proposes an open NLP framework demonstrated via the National COVID Cohort Collaborative (N3C) using COVID-19 clinical notes. The framework includes open data annotation, a community-driven ruleset platform, and synthetic text data generation. Evaluations on datasets from multiple institutions show promising results for multi-institution clinical NLP study and adoption. | |
| # A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text | |
| This review paper covers deep learning methods for generating Java code from natural language text, highlighting techniques ranging from RNN to Transformer-based models. It categorizes models into encoder-only, decoder-only, and encoder-decoder types, discusses datasets, evaluation metrics, and points to future research directions. | |
| # Towards the Study of Morphological Processing of the Tangkhul Language | |
| This work initiates morphological processing for the Tangkhul language using an unsupervised approach. Despite limited data, the morpheme identification using Morphessor shows reasonable output, proving the approach's viability in low-resource language NLP. | |
| # An Automated Multiple-Choice Question Generation Using Natural Language Processing Techniques | |
| This paper presents an NLP system for automatic multiple-choice question generation (MCQG) targeted at computer-based testing. It extracts important keywords from lesson materials to generate relevant questions. The system's effectiveness is validated on multiple lesson materials, showing capability in keyword extraction and question setting. | |
| # SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks | |
| Explores the novel use of prompting with pre-trained speech language models to perform various speech processing tasks unified under a speech-to-unit generation framework. The study shows competitive results with fine-tuning methods and highlights the framework's potential, especially in few-shot learning scenarios and future advanced speech LMs. | |
| # Summary | |
| These papers reflect diverse and advanced natural language processing research including clinical NLP frameworks, code generation from natural language, low-resource language morphology, automated educational question generation, and innovative prompting for speech models, indicating broad and impactful directions in the field. | |
| --- | |
| ### Papers: | |
| 1. https://arxiv.org/abs/2110.10780v3 | |
| 2. https://arxiv.org/abs/2306.06371v1 | |
| 3. https://arxiv.org/abs/2006.16212v1 | |
| 4. https://arxiv.org/abs/2103.14757v1 | |
| 5. https://arxiv.org/abs/2408.13040v1 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment