AI Scientist 2023-Present
AI Scientist at Vooban, specializing in applied AI consulting. Responsible for end-to-end delivery of AI solutions. Includes feasibility studies, solution design, development and production deployment. Areas of work cover computer vision, audio processing, LLM/VLMs, NLP, generative AI, and recommender systems, with a strong emphasis on building client-ready products.
Key Contributions
- AI development: Built and deployed models for object detection/segmentation, text-voice alignment, voice isolation, LLMs/VLMs, OCR, and recommender systems.
- Full-stack AI pipelines: Designed workflows covering data collection, annotation, preprocessing, training, evaluation, and deployment.
- Architecture & planning: Defined project architectures, estimated resources (Jira), and led feasibility assessments with clients.
- Production integration: Integrated research prototypes into production codebases, wrote documentation and technical guides.
- Training & outreach: Delivered workshops and conferences on Ethical & Responsible AI for diverse audiences (entrepreneurs, developers, marketing teams).
Example projects
- Detection assistance for dubbing projects: Using a mix of audio processing, voice recognition, computer vision and traditional phoneme theory, detect several relevant elements of movies to assist with automated dubbing workflows, reducing manual editing time by 50%.
- Automatic takeoff of steel structures from architectural drawings: Using computer vision and OCR techniques to automatically extract measurements and quantities from blueprints.
- Donator probability estimation: Using traditional machine learning and statistical models to predict donation likelihood from client data.
- Automatic countertop submission form extraction: Using computer vision, OCR and form parsing techniques to automatically extract data from submission forms, accelerating data entry.
- Automatic X-Ray auto part defect detection: Using object detection models to identify defects in automotive parts from X-ray images. Focusing on optimizing recall to reduce false negatives in quality control.
Tools & Technologies
AI-related: Python, PyTorch, Hugging Face, scikit-learn, OpenCV, pydantic-ai, LLM/VLM, Roboflow, MLFlow, Windsurf & other coding assistants, prompt engineering.
General programming: Git, Docker, MacOS.
Generic skills: project scoping, client communication, technical writing, workshop facilitation, multi-disciplinary collaboration, agile development, version control, re-inventing programming with agents in the loop.