My MSc research workspace
This page is for showcasing my MSc research project on Native Language Identification (NLI) Task.
You can find more by clicking each tab or vising my research blog at: https://research.faruk.me.
Proposal Link: MSc Research Proposal
- Topic: Native Language Identification (NLI)
- Angle: LLMs, fairness, robustness, confound control
- Datasets: Dataset link will be available soon...
- RQ1 — coming soon
- RQ2 — coming soon
- RQ3 — coming soon
Research timeline expands to 16 weeks as per University regulation. Starting from Jan, 2026 ~ Apr, 2026. I splitted these weeks into 6 sprints as bellow -
- MSRF-1# — 2 Weeks
- MSRF-2# — 2 Weeks
- MSRF-3# — 3 Weeks
- MSRF-4# — 3 Weeks
- MSRF-5# — 2 Weeks
- MSRF-6# — 3 Weeks
- Setup & Data Pipeline
- Baseline & Zero Shot Models
- Debiased & Hybrid models
- Openset Recognition
- Evaluation & Synthesis
- Dissertation writing & Review
Curated literature archive for NLI research-
Open the archive page here:
Open literature archiveThe literature archive lists past research works chronologically based on publishing year & technology shifts.
- Classic ML baselines (SVM, char n-grams)
- Syntactic transfer features
- Topic bias / confounds
- Neural & transformer era
- LLMs: zero-shot, robustness, ethics
All publications related to my MSc research will be listed here-
- Review paper — (link)
- Methodology write-up — (link)
- Results & discussion — (link)
- Target venue — (status)
- Backup venue — (status)
- Presentation materials — (status)
Quick scratchpad for ideas, citations to add, experiment reminders, etc.