Josue Ortega Caro

Machine Learning for Neural Dynamics

Wu Tsai Fellow · Yale University · Cardin Lab

Computational Neuroscience Foundation Models Generative Models Inductive Bias

I develop computational models and machine learning methods to understand how neural circuits give rise to perception, learning, and decision-making. My work spans foundation models for brain activity, neural integral equation architectures, and uncovering how neuromodulatory signals change during learning.

Foundation Models for Neuroscience

Self-supervised transformers for fMRI and neural recordings at scale

Neural Integral Equations

Learning integral operators from spatiotemporal biological data

Robustness & Inductive Bias

How architectural choices shape generalization in biological and artificial networks

BrainLM: Foundation model for brain activity

Foundation Models for Brain Activity

BrainLM is a foundation model trained on 6,700 hours of fMRI recordings. It predicts clinical variables, forecasts future brain states, and discovers functional networks through zero-shot inference.

ICLR 2024

Neural Integral Equations

Attentional Neural Integral Equations (ANIE) learn unknown integral operators from data, providing a principled framework for modeling spatiotemporal dynamics in physical and biological systems.

Nature Machine Intelligence 2024

PRISMT: Cortical cholinergic signaling during learning

PRISMT: Cortical Cholinergic Signaling & Learning

A structured multimodal transformer combining masked autoencoding with causal attention to reveal selective changes in cholinergic signaling during visual perceptual learning.

bioRxiv 2025

FLUX: geometry-aware longitudinal flow matching with mixture of experts

FLUX: Longitudinal Flow Matching

Geometry-aware flow matching for unpaired longitudinal snapshots with mixture-of-experts velocity decomposition for unsupervised regime discovery, evaluated on Lorenz dynamics, widefield cortical learning, and embryoid body differentiation.

arXiv 2026

Robust models rely on low frequency information

Robustness & Generalization

Robust deep learning models rely on low-frequency information. Local convolutions induce an implicit bias toward high-frequency adversarial examples via the Fourier Uncertainty Principle.

PLOS Comp. Bio. 2023 · Frontiers 2024

bioRxiv 2025

Josue Ortega Caro

Understanding Brain Computation Through Machine Learning

Key Research Areas

Foundation Models for Brain Activity

Neural Integral Equations

PRISMT: Cortical Cholinergic Signaling & Learning

FLUX: Longitudinal Flow Matching

Robustness & Generalization

Recent Work

Interpretable Neural Dynamics

BrainLM: A foundation model for brain activity recordings

Learning integral operators via neural integral equations

Cell2sentence: Teaching large language models the language of biology

Recurrent computations for visual pattern completion