Chibuzor Okocha

I am a PhD student in Engineering with a minor in Computer Science at the University of Florida, where I am privileged to be a member of the UF DataStudio Lab. Previously, I worked on building speech and audio models for African languages and accents.

My research is focused on Speech and Audio AI, reasoning in Audio Language Models, and developing robust systems for accented and multilingual speech processing. I am passionate about creating inclusive AI systems that work across multiple languages and cultures.

Beyond research, I am passionate about mentoring aspiring AI researchers, building open science communities, and contributing to collaborative initiatives. I firmly believe in democratizing access to knowledge and fostering collaborative ecosystems.

Email CV GitHub Google Scholar

Recent News

🏖️

[2025]

Presenting poster "Can Large Audio Language Models Understand Child Stuttering Speech? Speech Summarization, and Source Separation" at ASRU 2025 Satellite Workshop in Hawaii.

📝

[Jan 25]

Submitted three papers to ICASSP 2026 on neural audio codecs and child speech analysis with LALMs.

🎤

[Sept 25]

Excited to present my research at the TTIC Summer Workshop on Foundations of Speech and Audio Foundation Models in Chicago.

🎉

[Jan 25]

Excited to present our AfriSpeech-Dialog work at NAACL 2025.

🔬

[Oct 24]

Presented research on intercultural understanding at FIE 2024 conference.

Research Areas

Speech and Audio AI

Developing advanced AI systems for speech and audio processing applications

Audio Language Models

Researching reasoning capabilities and cognitive processes in audio language models

Accented Speech Recognition

Building robust speech recognition systems for diverse accents and dialects

Multilingual Audio AI

Creating inclusive AI systems that work across multiple languages and cultures

Recent Publications

Afrispeech-Dialog: A Benchmark Dataset for Spontaneous English Conversations in Healthcare and Beyond

NAACL 2025 | Mardhiyah Sanni, Tassallah Abdullahi, Devendra D. Kayande, Emmanuel Ayodele, Naome A. Etori, Michael S. Mollel, Moshood Yekini, Chibuzor Okocha, et al.

A comprehensive dataset for evaluating ASR and summarization on African-accented speech conversations.

AfriVox: Probing Multilingual and Accent Robustness of Speech LLMs

ACL ARR July 2025 (Under Review) | OpenReview

Open-source benchmark across 20 African languages and 100+ African English accents, evaluating multimodal speech LLMs vs traditional ASR/AST models.

Neural Audio Codec Evaluation for Low-Resource African Languages

ICASSP 2026 (Under Review) | Chibuzor Okocha, et al.

Comprehensive evaluation framework for neural audio codecs on African speech data and low-resource language settings.

Can large audio language models understand child stuttering speech?

ICASSP 2026 (Under Review) | Chibuzor Okocha, Maya Bakri, Christan Grant | arXiv

Evaluating LALMs on disfluent child speech for source separation and summarization tasks.

Domain-Aware Speaker Diarization On African-Accented English

arXiv preprint (Under Review) | Chibuzor Okocha, Kelechi Ezema, Christan Grant | arXiv

Examining domain effects in speaker diarization for African-accented English across general and clinical dialogues.

View All Publications →

Featured Projects

AfriSpeech-200

Pan-African speech dataset with 100+ accents and 196+ hours of audio for ASR research.

Speech ProcessingASR

CodecEval-Africa

Neural audio codecs evaluation framework for low-resource African language settings.

Neural CodecsLow-Resource

Child Speech Analysis with LALMs

Large Audio Language Models for child interview summarization and speaker separation.

LALMsChild Speech

View All Projects →