Nakato Foto

verified

Nakato

Research Scientist

Focused on Multimodal Representation Learning and Large Language Models. Bridging the gap between vision and language through scalable architectural innovations and RLHF.

link LinkedIn code GitHub alternate_email Email

About Me

I am a dedicated researcher with a passion for understanding how machines can learn more cohesive representations of our world. My work currently explores the intersection of visual perception and linguistic reasoning.

check_circle Multimodal Alignment & Contrastive Learning
check_circle Efficient Attention Mechanisms in Transformers
check_circle Human-in-the-loop Reinforcement Learning (RLHF)
check_circle Representation Alignment across Domains

hub

visibility Vision

translate Language

Featured Research Projects

Imitator

A multimodal framework for gesture-to-speech translation using transformer architectures. Focused on zero-shot generalization across diverse speaker profiles.

PYTORCH LLAMA TRANSFORMERS SIGN LANGUAGE

View Project arrow_forward

ActivAdda

Implemented activation addition (steering vectors) to bias LLM responses without parameter updates.

PYTHON LLM EMBEDDINGS INTERPRETABILITY

View Project arrow_forward

SKAI - Affective Response Generation

Trained a reward model over fine-grained emotional categories and adapted GPT-2 to generate supportive and sentiment-aware responses.

PPO RLHF PYTORCH

View Project arrow_forward

Experiencia Laboral

Superintendencia de Banca, Seguros y AFP

presencial

AI Developer

Automation of end-to-end testing with Cypress, managing data pipelines and Jira integration for large-scale deployments.

2025 — Present

Educación

Pontificia Universidad Católica del Perú

Artificial Intelligence (AI) Summer Camp

August — 2025

Universidad Peruana de Ciencias Aplicadas

Universidad Peruana de Ciencias Aplicadas

B.S. Computer Science

Specialization in Artificial Intelligence and Neuroscience

2022 — Present

Selected Publications

Learning Motion-Based Embeddings for Sign Language Retrieval.

Manuscript in preparation (Draft) — 2026

picture_as_pdf

Imitator: Multimodal Sign Language Model

SimBig25 International Conference on Information Management and Big Data — 2025

picture_as_pdf

Let's build the future.

Interested in collaborating on Data/ML research? I'm always open to discussing new opportunities or academic partnerships.

Say Hello View GitHub