Welcome! I am an Assistant Professor of Data Science at the New York Institute of Technology (NYIT), Vancouver campus, and an Adjunct Professor in the Department of Mathematics & Statistics at the University of Victoria (UVic), British Columbia. I earned a BSc (Honours) in Statistics from the University of Sri Jayewardenepura, Sri Lanka, and an MSc in Statistics from the University of Manitoba (July 2021). Guided by Prof. Saman Muthukumarana and Dr. Mike Domaratzki, my doctoral research developed new methods for addressing class imbalance in classification tasks, with an emphasis on accuracy and interpretability; related work has appeared in the Journal of Machine Learning with Applications and PeerJ Computer Science. I also completed a postdoctoral fellowship at the University of Manitoba with Prof. Muthukumarana, Dr. Domaratzki, and Dr. Max Turgeon.
With over nine years of combined academic, industry and government experience, I have worked at the intersection of statistics, machine learning, and real-world applications, delivering impactful solutions in public health, education, and beyond. My passion lies in developing methods that are not only powerful, but also interpretable and accessible, enabling data-driven decision-making for meaningful change.
Last updated: November 09, 2025
Matharaarachchi S., Turgeon M., Domaratzki M., Muthukumarana S. (2024). “Sequential Bayesian Estimation of the F1 Score Using the Dirichlet-Multinomial Model." International Journal of Data Science and Analytics.
Katz, A., Ekuma, O., Enns, J., Cavett, T., Singer, A., Sanchez-Ramirez, D., Keynan, Y., Lix, Y., Walld, R., Yogendran, M., Nickel, N., Urquia, M., Star, L., Olafson, K., Logsetty, S., Spiwak, R., Waruk, J., Matharaarachichi, S. (2025) “Identifying people with post-COVID condition using linked, population-based administrative health data from Manitoba, Canada: prevalence and predictors in a cohort of COVID-positive individuals.” BMJ open, 15 (1), e087920.
Matharaarachchi S., Domaratzki M., Muthukumarana S. (2024). “Enhancing SMOTE for Imbalanced Data with Abnormal Minority Instances.” Machine Learning with Applications.
Matharaarachchi S., Domaratzki M., Katz A., Muthukumarana S. (2022). “Discovering Long COVID Symptom Patterns: Association Rule Mining and Sentiment Analysis in Social Media Tweets.” JMIR Form Res.
Matharaarachchi S., Domaratzki M., Marasinghe C., Muthukumarana S., and Tennakoon V. (2022). “Modeling and Feature Assessment of the Sleep Quality among Chronic Kidney Disease Patients.” Sleep Epidemiology.
Matharaarachchi S., Domaratzki M., Muthukumarana S. (2022). “Minimizing features while maintaining performance in data classification problems.” PeerJ Computer Science 8:e1081.
Enns, J., Katz, A., Yogendran, M., Urquia, M., Muthukumarana S., Matharaarachchi, S., Singer, A., Nickel, N., Star, L., Cavett, T., Keynan, Y., Lix, L. and Sanchez-Ramirez, D. (2022) “A population data-driven approach to identifying ‘Long COVID’ cases in support of diagnosis and treatment.” International Journal of Population Data Science, 7(3).
Matharaarachchi, S., M. Domaratzki, and S. Muthukumarana (2021). “Assessing feature selection method performance with class imbalance data.” Machine Learning with Applications. This paper was awarded with the Reproducibility Badge Initiative (RBI).
Matharaarachchi S., M. Domaratzki, A. Katz, S. Muthukumarana. (2024). “Long COVID Prediction in Manitoba Using Clinical Notes Data: A Machine Learning Approach.”
Matharaarachchi S., M. Domaratzki, S. Muthukumarana. (2024). “Deep-ExtSMOTE: Integrating Autoencoders for Advanced Mitigation of Class Imbalance in High-Dimensional Data Classification.” Journal of Big Data Research.
GPA: 4.13/4.5
Thesis: New Developments for Addressing Class Imbalance Issue in Classification Tasks.
GPA: 3.8/4.0
First two years included coursework in Mathematics, Computer Science, and Statistics.
Dissertation: Study on Parliamentary General Electoral Systems in Sri Lanka.
Fall 2025
Fall 2025
Summer 2022
| Year | Course | Term |
|---|---|---|
| 2022 | STAT 4150 - Bayesian Analysis and Computing | Fall 2022 |
| 2022 | STAT 7270 - Bayesian Inference | Fall 2022 |
| 2022 | STAT 2000 - Basic Statistical Analysis 2 (n=2) | Winter 2022 |
| 2021 | STAT 4150 - Bayesian Analysis and Computing | Fall 2021 |
| 2022 | STAT 7270 - Bayesian Inference | Fall 2021 |
| 2021 | STAT 2000 - Basic Statistical Analysis 2 (n=2) | Fall 2021 |
| 2021 | STAT 2000 - Basic Statistical Analysis 2 | Summer 2021 |
| 2021 | STAT 2000 - Basic Statistical Analysis 2 (n=2) | Winter 2021 |
| 2020 | STAT 2000 - Basic Statistical Analysis 2 (n=2) | Fall 2020 |
| 2020 | STAT 4150 - Bayesian Analysis and Computing | Winter 2020 |
| 2022 | STAT 7270 - Bayesian Inference | Winter 2020 |
| 2020 | STAT 1000 - Basic Statistical Analysis 1 (n=2) | Winter 2020 |
| 2019 | STAT 1000 - Basic Statistical Analysis 1 | Fall 2019 |
Note: n = Number of sections conducted for the same course.
Cybersecurity Day 2025, New York Institute of Technology, Vancouver Campus.
65th ISI World Statistics Congress 2025, The Hague, Netherlands
Statistical Society of Canada (SSC) Annual Meeting 2025, Saskatchewan, Canada
International Statistics Conference 2024 (ISC2024), Colombo, Sri Lanka.
International Statistics Conference 2024 (ISC2024), Colombo, Sri Lanka.
PhD Theis Defense, Department of Statistics, Faculty of Graduate Studies, University of Manitoba.
Faculty of Graduate Studies, University of Manitoba.
4th International Conference on Future of Preventive Medicine & Public Health (Future of PMPH 2024).
2024 WNAR/IMS/Graybill Annual Meeting, Fort Collins, Colorado - Student Paper Competition presentation.
CANSSI Show Case 2023
Data to Action Day 2023, organized by the Data Science Program, Government of Manitoba.
Statistical Society of Canada (SSC) Annual Meeting 2022.
Joint Statistical Meetings (JSM) 2021.
MSc Theis Defense, Department of Statistics, Faculty of Graduate Studies, University of Manitoba.
My research spans machine learning, statistical learning, and computational statistics, with a focus on high-dimensional data analysis, classification, and algorithmic approaches. I develop Bayesian methods and deep-learning techniques—together with principled feature engineering, knowledge representation, and resampling—to handle class imbalance, anomaly detection, and outlier-robust modeling. I also work in NLP and sequential settings, building methods that remain interpretable and reliable under distributional shift. Much of my work targets healthcare and education, where I aim to bridge theory and practice by creating efficient, transparent models that deliver actionable insights in real-world deployments.