Hi, I'm

Bryium Onyancha

AI Researcher & Engineer

I develop scalable AI and machine learning systems, from transformer fine-tuning and model optimization to production-ready ML applications with robust backend APIs and responsive web interfaces.

About Me

I am a research-focused Master's student in Computer Science with hands-on experience in machine learning, including transformer models, reinforcement learning, recursive language knowledge, and scalable GPU-accelerated ML pipelines.

Alongside my research work, I am a software engineer skilled in building modern web applications, robust backend APIs, and mobile applications using contemporary frameworks and tools.

My research focus areas include Large Language Models, Diffusion Models, Multimodal Systems, Recursive Language Knowledge, and Computer Vision — with a long-term goal of developing scalable, compute-efficient architectures for next-generation LLMs and multimodal agents.

I conduct research on efficient training, adaptive inference, and reinforcement-learning–based reasoning in modern foundation models. I also focus on post-training techniques such as RLHF, RLAIF, and iterative refinement for alignment, controllability, and deep reasoning. Additionally, I explore model compression, architecture search, and training-time optimizations to reduce compute while preserving frontier-level performance.

My methods are applied to computer vision tasks like object recognition, image generation, and scene understanding, as well as predictive modeling problems in structured or sequential data.

Skills

ML & AI

  • Deep Learning & Neural Networks
  • Transformer Models & LLMs
  • Reinforcement Learning (RLHF, RLAIF)
  • Diffusion Models
  • Computer Vision (OpenCV, MediaPipe)
  • Model Compression & Optimization
  • RAG Systems

Languages

  • Python (Django, Flask)
  • JavaScript / TypeScript
  • Java & Kotlin
  • Go & C++
  • SQL

Web & Mobile

  • React / Next.js
  • React Native / Expo
  • Node.js
  • RESTful API Design
  • Auth (JWT, NextAuth, OAuth)
  • LangChain.js

Tools & Infra

  • Git & GitHub
  • Docker
  • PostgreSQL / MySQL / SQLite
  • Prisma & Django ORM
  • Cloud Deployment & Scaling
  • GPU-Accelerated Training

Experience

LLM Trainer

Turing · Remote Jul 2025 – Dec 2025
  • Contributed to post-training and fine-tuning of large language models, focusing on improving reasoning quality, alignment, and response consistency.
  • Worked with reinforcement learning environments (RL Gym-style setups) to evaluate and refine model behavior during training and inference.
  • Supported API development and tooling for model evaluation, data curation, and iterative refinement workflows.
  • Collaborated remotely with cross-functional teams to analyze model outputs, debug failure cases, and improve training pipelines.

Software Engineer

DukaTech Solutions · Nairobi Oct 2024 – May 2025
  • Contributing to developing innovative applications at DukaTech Solutions.
  • Involved in the development of Shop Okoa and Mamapesa, a savings and loan platform.
  • Integrated machine learning models into Mamapesa to predict loan eligibility.
  • Developed an AI chatbot to simplify loan applications and educate users on financial concepts.

Projects

ML / NLP

Bryium AI

Intelligent RAG-powered chatbot built with Flask and LangChain.js for real-time, context-aware conversations. Integrated Gemini API for dynamic AI responses with external knowledge retrieval.

FlaskLangChain.jsRAGGemini API
View on GitHub →
Full-Stack / AI

HURA Chatbot

Conversational web application delivering AI-driven replies and real-time weather forecasts based on user location, with a responsive interface for seamless UX.

JavaScriptAIWeather APICSS
View on GitHub →
Computer Vision

Object Detection System

Object detection using pretrained YOLOv5 & Faster R-CNN models. Fine-tuned on custom datasets with bounding box visualization and mAP evaluation metrics.

YOLOv5Faster R-CNNPythonPyTorch
View on GitHub →
Time Series

Energy Consumption Forecasting

Time-series forecasting system predicting energy consumption trends using statistical and deep learning models (LSTM) with rolling-window validation.

PyTorchLSTMPandasStatsmodels
View on GitHub →
NLP / Research

Transformer Text Classification

Rigorous evaluation of BERT & DistilBERT for text classification with ablation studies, class-wise metrics, confusion matrices, and structured error analysis.

BERTDistilBERTHuggingFacePython
View on GitHub →
Deep Learning

CNN Image Classifier

CNN-based image classifier for handwritten digit recognition (MNIST) with data augmentation techniques and comprehensive error analysis visualizations.

CNNMNISTPythonTensorFlow
View on GitHub →

Education

Master's Degree

William Jessup University

San Jose, CA

Expected 2027

Research Focus: Large Language Models, Diffusion Models, Multimodal Systems & Computer Vision

Bachelor of Science, IT

Dedan Kimathi University of Technology

Nyeri, Kenya

Graduated 2024

Second Class Honors · Software Engineering, Networking & Machine Learning

Get In Touch

I'm always open to discussing new projects, research collaborations, or opportunities. Feel free to reach out!

Email: bryiumonyancha@gmail.com

Phone: +1 (425) 534-9730

Location: Lynnwood, WA, USA