Software & Datasets
Open-source tools, reproducibility packages, and datasets
MANARA Lab — Current Releases
Current reproducibility materials for MANARA Lab publications are maintained at github.com/MANARA-Lab-UM6P. Each repo includes the paper reference, reproducible experiments, datasets or links to data, and an open-source license.
- TCEF — Reproducibility files for: "Robust Feature Extraction Using Temporal Context Averaging for Speaker Identification in Diverse Acoustic Environments," IEEE Access, Jan. 2024. github.com/MANARA-Lab-UM6P/TCEF
- CAT-Net — Reproducibility files for: "CAT-Net: A Channel and Self-Attention TCN for Robust Frame-Level Overlapping Speech Detection," IEEE TASLP, 2026. github.com/MANARA-Lab-UM6P/CAT-Net
- Synthetic-Granular-Gen — Reproducibility files for: "Instant Particle Size Distribution Measurement Using CNNs Trained on Synthetic Data," CVPR Synthetic Data Workshop, 2025. github.com/MANARA-Lab-UM6P/Synthetic-Granular-Gen
- WFIV — Reproducibility files for: "An Efficient Task Allocation in Mobile Crowdsensing Environments," IEEE TNSM, 2025. github.com/MANARA-Lab-UM6P/WFIV
- PLTA — Reproducibility files for: "PLTA: Private Location Task Allocation using Multidimensional Approximate Agreement," IEEE CNS, 2024. github.com/MANARA-Lab-UM6P/PLTA
Earlier Releases
- Approximate Randomization Tool — Statistical significance testing tool used in crowdsensing research. github.com/Abdeddine/Approximated_randomization_test
- Speech Segregation (IBM-based) — A learned approach for speech segregation. DOI: 10.24433/CO.2877154.v1. codeocean.com/capsule/1202410/tree
- KIT-30 Dataset — A dataset with 30 authors for evaluating author identification models against the Emirati tweets domain, with additional languages for cross-lingual comparisons. gitlab.com/mmaakh/kit-30
- Fextractor — Feature extraction library for stylometry research. gitlab.com/mmaakh/fextractor
- Stylometry Survey Evaluation Code — Evaluation code for: "Evaluating Author Attribution on Emirati Tweets," IEEE Access, vol. 8, August 2020. gitlab.com/mmaakh/stylometry-survey-evaluation
- Phishing Classification Study — Reproducibility files for: "A Study of Feature Subset Evaluators and Feature Subset Searching Methods for Phishing Classification." khonji.org/phishing_studies.html
- Artest — Statistical significance tester for Area Under Curve (AUC) using Approximate Randomization. Released under GPLv2. github.com/mmaakh/artest
- KECI Action Recognition Skeletal Dataset — Skeletal dataset from two views for human action recognition research. kecidev.kaist.ac.kr/KECI_TwoViews_Skeletal_Dataset.html
- WUCLP — Waterloo User Controlled Lightpaths — A distributed application for managing lightpaths via web-based or GRID interface. A live version ran on CANARIE's Canada-wide optical network. uclp.uwaterloo.ca