Nevyn Duarte | Data Scientist & AI Engineer

Building intelligent systems for real-world impact

I'm Nevyn Duarte, and I engineer large-scale AI platforms and machine learning systems for real-world impact. Currently at Bridges AI Consulting, I build end-to-end solutions for content understanding and automated decision-making, while completing my Master of Science in Data Science at CU Boulder.

Bridges AI Consulting ↗

Co-Founder, CTO & Lead AI Engineering Consultant

Dec 2025 – Present

Led AI consulting engagements across concurrent product builds, from client-facing technical scoping and architecture design through production deployment
Built GPT-4o and Anthropic Claude-powered content generation pipelines with multi-step prompt chaining, agentic workflows, and automated publish across decoupled FastAPI and Next.js services
Developed ML platforms across 3M+ records using LightGBM/XGBoost valuations, YOLOv11 + CLIP computer vision, and spatial clustering trained on local RTX 3090 via PyTorch CUDA
Deployed 13-service backends on AWS ECS Fargate with PostgreSQL/PostGIS and Cloudflare Workers edge deployments; implemented compliance-aware multi-tenant data controls
Designed scalable ingestion and transformation layers for unstructured data streams, extracting structured signals for downstream analytics and automation

Machine Learning AI Systems Data Engineering Python NLP Platform Design Scalable Systems

M Science (Jefferies) ↗

Quantitative Equity Research Associate

Sep 2022 – Feb 2023

Developed predictive equity models using PySpark and SQL on Databricks, analyzing millions of transactions and job postings data
Produced 10+ data-driven research reports with FactSet and REST APIs, accelerating report cycles from quarterly to monthly
Automated data extraction and reporting workflows, reducing reporting effort by 20% and enhancing team efficiency
Collaborated with senior analysts to deliver alternative-data insights for institutional investors

PySpark Databricks Python SQL FactSet API

Goodfill ↗

Backend Software Engineer Consultant

Jul 2022 – Sep 2022

Developed investor onboarding and FINRA record verification systems using Python and FINRA API integration
Engineered backend services in Go with AWS Lambda/SQS/SNS architecture
Integrated trading applications with Interactive Brokers Gateway & TWS APIs
Deployed containerized infrastructure with Docker to improve scalability and reliability

Go AWS Lambda Docker IB Gateway

Perceive Now ↗

Data Analyst

Jul 2022 – Oct 2022

Implemented HuggingFace NLP models on AWS Lambda for zero-shot classification, summarization, and entity extraction to process 50k+ research articles
Streamlined REST API pipelines in Python and Jupyter, cutting report generation time from minutes to seconds
Enhanced large-scale text processing pipelines, reducing operating costs by 12%

HuggingFace NLP AWS Lambda Python

Citco (Citco Fund Services) ↗

Risk Analysis Intern

May 2022 – Jul 2022

Automated AIFMD and Form PF reporting for 20+ hedge funds using Python, VBA, SQL, and PySpark
Supported internal risk and valuation models for portfolios exceeding $10B AUM across multiple fund structures
Developed Excel analytics dashboards to improve fund performance reporting

PySpark Risk Modeling SQL VBA

AMD (Advanced Micro Devices) ↗

Aug 2021 – Jun 2022

Yield Analysis Intern

Designed an adaptive outlier detection algorithm using nearest-neighbor ML to flag yield anomalies across 100k+ wafer samples
Built interactive Power BI dashboards adopted by 20+ engineers to track production test results and yield analysis
Visualized trends in JMP and Python to improve yield analysis accuracy
Automated failure alerts with pandas/SciPy/Power Automate for real-time issue resolution

Product Development Intern

Built real-time Power BI dashboards to monitor lab testing machine status and work orders
Integrated 12+ data sources, including Snowflake and MySQL, using Power Automate and Beautiful Soup
Automated reporting and anomaly detection workflows in Python to identify machine failures
Created shipping and inventory tracking workflows that improved operational efficiency by ~15%

Machine Learning Python JMP Power BI Snowflake Automation MySQL

BNY (The Bank of New York Mellon Corp) ↗

Summer Data Analytics Analyst

Jun 2021 – Aug 2021

Designed Tableau, Excel, and Python dashboards for liquidity monitoring, client data visualization, and analytics across Sales and Analytics teams
Reduced reconciliation time by 20% for 3+ business units
Developed automation scripts in VBA and Python to reduce manual reporting time
Contributed to a centralized data warehouse to improve data accuracy across teams
Partnered with cross-functional teams to streamline client liquidity and performance analysis

Tableau Python Data Analytics VBA

UT Austin BWI Project ↗

Undergraduate Research Associate

Jan 2019 – Jun 2020

Developed trajectory-based person-following systems using DeepSORT tracking and Google's Triplet Loss function
Programmed robot navigation in Python, C++, and ROS for autonomous operation
Co-authored research paper and received UT CNS Award for Excellence in Computer Science

Computer Vision Python C++ ROS DeepSORT

Order.co (formerly Negotiatus) ↗

Software Engineering Intern

Jun 2017 – Aug 2017

Built interactive D3.js and AJAX dashboards for real-time sales data visualization
Migrated CRM data from HubSpot to Salesforce through API integrations and automated scripts
Enhanced front-end features with HTML, CSS, and JavaScript, plus RSpec tests in Ruby on Rails

D3.js Ruby on Rails JavaScript Salesforce API

stae ↗

Software Engineering Intern

Jun 2016 – Sep 2016

Designed interactive web dashboards with PostgreSQL, React, D3.js, and Node.js for client data visualization
Redesigned company web pages to align with updated product interfaces and branding
Authored comprehensive developer documentation on Linux systems for team onboarding

React D3.js Node.js PostgreSQL

Building intelligent systems for real-world impact

About

ML & AI

Data Engineering

Languages

Finance Tools

Experience

Bridges AI Consulting ↗

M Science (Jefferies) ↗

Goodfill ↗

Perceive Now ↗

Citco (Citco Fund Services) ↗

AMD (Advanced Micro Devices) ↗

BNY (The Bank of New York Mellon Corp) ↗

UT Austin BWI Project ↗

Order.co (formerly Negotiatus) ↗

stae ↗

Education

Projects

Robotics ML Research — UT Austin BWI Project

Equity Prediction Models

Research Article Processor

Research & Foundations

Research philosophy

Reading status taxonomy

Statistical Learning & Inference

Machine Learning & AI

Quantitative Finance

Software Engineering & Systems

Forward-looking research interests

Blog & Technical Commentary

Designing equity prediction models that survive data drift

Calibration is not optional in financial classification

Most ML pipelines fail silently — here is how I audit them

Let's connect

Nevyn Duarte — Data Scientist & Machine Learning Engineer