Hello, I'm

Sebastian Gomez

I'm a ML and Software engineer currently finishing a MSc in Biomedical Engineering with a research focus on ML applications, based in Bogotá, Colombia but with a lot of experience in working with multicultural and remote teams. My personal and professional interests include developing agentic AI workflows, NLP solutions, and ML models using LangChain, PyTorch and TensorFlow. With proven background developing impactful technical solutions at YC-backed startups, I'm eager to bring my abilities to new venture-backed and innovative companies.

Let's get in touch

Machine learning intern (Gen AI)

July 2025 - Present

Provectus | San Francisco, California - Remote

ML Internship at provectus, focusing on Generative AI. I am currently working on a project that involves building AI features leveraging agentic workflows for intelligent data parsing and processing.

    • Optimizing and benchmarking state-of-the-art Large Language Models (LLMs).
    • Evaluating and enhancing ML pipelines, encompassing data preprocessing, model training, and rigorous performance analysis.
    • Fine-tuning expert models to use in agentic AI workflows.
    • Leveraging cloud-based ML services (AWS/GCP) to deploy GenAI solutions into production environments.
    • Staying on top of the rapidly evolving GenAI landscape through continuous research.

Languages and Tools: Python, AWS, AWS Bedrock, AWS Sagemaker, MLFLow, LiteLLM, Instructor

Growth and Software Development Engineer

July 2023 - May 2025

JustPaid | San Francisco, California - Remote

Full stack development of internal features in both backend (Django) and frontend. I developed flag-based features for users to have access to more capabilities inside the platform. I enhanced the authentication workflow and added user sessions that store information about last connection and last connected account. I developed a suite for founders to have control over the access to the app information at all levels for each user.

    • Built AI features leveraging agentic workflows for intelligent data parsing, processing, and structured database entry
    • Designed and implemented an agentic pipeline for contract parsing from raw PDF inputs
    • Assesses whether input documents contain sufficient data to generate a valid legal contract
    • Uses LlamaIndex to extract relevant text content
    • Parses extracted data into Pydantic models for Customer, Contract, and LineItem entities
    • Integrated a self-validation loop and agent-based decision system to ensure extraction accuracy and data integrity
    • Automatically creates and stores validated records in a PostgreSQL database via Django ORM

Languages and Tools: NodeJS, Django, LangChain, LangGraph, LlamaIndex

Contact: Daniel Kivatinos, daniel@kivatinos.com

ML Graduate Teaching Assistant

August 2023 - May 2025

Universidad de los Andes | Bogotá, Colombia

Conducted laboratory sessions on machine learning fundamentals, covering optimization techniques, linear and logistic regression models, analytical solutions in Ordinary Least Squares (OLS), hyperparameter fine-tuning methodologies, and neural network architectures

    • Managed approximately 70 students per semester. Responsible for laboratory resource creation and studentwork assessment.
    • Grade given by students: 4.90/5.00

Languages and Tools: NodeJS, Django, LangChain, LangGraph, LlamaIndex.

Contact: Luis Felipe Giraldo Trujillo, lf.giraldo404@uniandes.edu.co

Consulting and Information Solutions Intern

January 2023 - June 2023

Roche | Bogotá, Colombia

Analytical and detail-oriented Information Solutions Intern with hands-on experience at Roche Colombia, specializing in healthcare data analysis and clinical process visualization.

    • Implemented an automatic reports and graphics generator for statistical information about thyroid ill patients.
    • Represented workflows for patient exams (pathology and blood) in hospitals into easy-to-understand block diagrams.

Languages and Tools: NodeJS, Django, LangChain, LangGraph, LlamaIndex.

Featured Projects

Project image 1

Multimodal Collection and Analysis of Colombian Sign Language

This research proposes the use of multimodal data combined with artificial intelligence algorithms to identify key formational parameters of Colombian Sign Language (LSC) and differentiate communicative characteristics between deaf people and interpreters.

Python, Tensorflow, Hugging Face, ViVit

Project image 2

Development of a Machine Learning Algorithm to Support the Diagnosis of Urological Diseases at Fundación Santa Fe de Bogotá

This project, in collaboration with Fundación Santa Fe de Bogotá, uses artificial intelligence to improve accuracy and consistency in the diagnosis of urological diseases.

Python, Scikit-learn, Pandas, NumPy

Project image 3

Personal Portfolio

This portfolio showcases my professional journey, including my projects, skills, and experiences in the field of biomedical engineering and artificial intelligence.

Next.js, React, Tailwind CSS, TypeScript