Nakato Foto
verified

Nakato

Research Scientist

Focused on Multimodal Representation Learning and Large Language Models. Bridging the gap between vision and language through scalable architectural innovations and RLHF.

About Me

I am a dedicated researcher with a passion for understanding how machines can learn more cohesive representations of our world. My work currently explores the intersection of visual perception and linguistic reasoning.

  • check_circle Multimodal Alignment & Contrastive Learning
  • check_circle Efficient Attention Mechanisms in Transformers
  • check_circle Human-in-the-loop Reinforcement Learning (RLHF)
  • check_circle Representation Alignment across Domains
hub
visibility Vision
translate Language

Featured Research Projects

Imitator

A multimodal framework for gesture-to-speech translation using transformer architectures. Focused on zero-shot generalization across diverse speaker profiles.

PYTORCH LLAMA TRANSFORMERS SIGN LANGUAGE
View Project arrow_forward

ActivAdda

Implemented activation addition (steering vectors) to bias LLM responses without parameter updates.

PYTHON LLM EMBEDDINGS INTERPRETABILITY
View Project arrow_forward

SKAI - Affective Response Generation

Trained a reward model over fine-grained emotional categories and adapted GPT-2 to generate supportive and sentiment-aware responses.

PPO RLHF PYTORCH
View Project arrow_forward

Experiencia Laboral

Superintendencia de Banca, Seguros y AFP

Superintendencia de Banca, Seguros y AFP

presencial

AI Developer

Automation of end-to-end testing with Cypress, managing data pipelines and Jira integration for large-scale deployments.

2025 — Present

Educación

Pontificia Universidad Católica del Perú

Pontificia Universidad Católica del Perú

Artificial Intelligence (AI) Summer Camp

August — 2025
Universidad Peruana de Ciencias Aplicadas

Universidad Peruana de Ciencias Aplicadas

B.S. Computer Science

Specialization in Artificial Intelligence and Neuroscience

2022 — Present

Selected Publications

Learning Motion-Based Embeddings for Sign Language Retrieval.

Manuscript in preparation (Draft) — 2026

picture_as_pdf

Imitator: Multimodal Sign Language Model

SimBig25 International Conference on Information Management and Big Data — 2025

picture_as_pdf

Let's build the future.

Interested in collaborating on Data/ML research? I'm always open to discussing new opportunities or academic partnerships.