About
Software engineer with 4+ years of experience across big data, machine learning research/operations and information retrieval (search and ranking). I operate at the intersection of distributed systems and machine learning and aim to solve problems at scale.
Currently, I do a lot of pre-training/adapting/fine-tuning language models for specific tasks or languages. Before this, I helped enterprises build and manage machine learning systems in on-premises and hybrid cloud environments.
Conference publications in EMNLP, SIGIR, & ACL covering large language models, multilingual NLP, and low-resource information retrieval. See Google Scholar
Contact
Work Experience
NanoDegrees covered:
- Machine Learning Engineer ND
- Machine Learning Engineer with Azure ND
- AWS Machine Learning Engineer ND
- Machine Learning DevOps ND
- Deep Learning ND
- Cloud Developer ND
Tools:
-
Azure Machine Learning
-
Python / Java
-
Pyspark
- Compression of large language models through distillation and pruning.
Tools:
-
Azure Machine Learning
-
Python
-
SQL
-
Spark MLlib
Projects
Cross-Lingual Information Retrieval for African Languages (CIRAL) Track was first hosted at the Forum for Information Retrieval Evaluation (FIRE) 2023.
Side Projects
Education
Courses:
-
Distributed Systems
-
Algorithm Design & Analysis
-
Requirements Engineering
-
Advanced Topics in Human-Computer Interaction: InfoVis for AI Explainability
-
Software Engineering for Big Data & AI
Thesis:
Scaling Pre-training Data and Language Models for African Languages
Papers:
-
Innovating in the Financial Industry: A Requirement Engineering Perspective
-
Distributed Systems for Scaling Foundation Models: A Critical Evaluation
-
Hope '23: Understanding Twitter Trends Behind Nigeria's Elections