Hi, I'm
I develop scalable AI and machine learning systems, from transformer fine-tuning and model optimization to production-ready ML applications with robust backend APIs and responsive web interfaces.
I am a research-focused Master's student in Computer Science with hands-on experience in machine learning, including transformer models, reinforcement learning, recursive language knowledge, and scalable GPU-accelerated ML pipelines.
Alongside my research work, I am a software engineer skilled in building modern web applications, robust backend APIs, and mobile applications using contemporary frameworks and tools.
My research focus areas include Large Language Models, Diffusion Models, Multimodal Systems, Recursive Language Knowledge, and Computer Vision — with a long-term goal of developing scalable, compute-efficient architectures for next-generation LLMs and multimodal agents.
I conduct research on efficient training, adaptive inference, and reinforcement-learning–based reasoning in modern foundation models. I also focus on post-training techniques such as RLHF, RLAIF, and iterative refinement for alignment, controllability, and deep reasoning. Additionally, I explore model compression, architecture search, and training-time optimizations to reduce compute while preserving frontier-level performance.
My methods are applied to computer vision tasks like object recognition, image generation, and scene understanding, as well as predictive modeling problems in structured or sequential data.
Intelligent RAG-powered chatbot built with Flask and LangChain.js for real-time, context-aware conversations. Integrated Gemini API for dynamic AI responses with external knowledge retrieval.
Conversational web application delivering AI-driven replies and real-time weather forecasts based on user location, with a responsive interface for seamless UX.
Object detection using pretrained YOLOv5 & Faster R-CNN models. Fine-tuned on custom datasets with bounding box visualization and mAP evaluation metrics.
Time-series forecasting system predicting energy consumption trends using statistical and deep learning models (LSTM) with rolling-window validation.
Rigorous evaluation of BERT & DistilBERT for text classification with ablation studies, class-wise metrics, confusion matrices, and structured error analysis.
CNN-based image classifier for handwritten digit recognition (MNIST) with data augmentation techniques and comprehensive error analysis visualizations.
San Jose, CA
Expected 2027
Research Focus: Large Language Models, Diffusion Models, Multimodal Systems & Computer Vision
Nyeri, Kenya
Graduated 2024
Second Class Honors · Software Engineering, Networking & Machine Learning
I'm always open to discussing new projects, research collaborations, or opportunities. Feel free to reach out!
Email: bryiumonyancha@gmail.com
Phone: +1 (425) 534-9730
Location: Lynnwood, WA, USA