ML Research Engineer

Audio AI · Embedded ML · MLOps

I build audio machine learning systems that run outside the notebook. My work focuses on models for sound event detection, voice activity detection, and privacy-preserving audio, with particular attention to deployment on resource-constrained hardware.

For three years I was a Research Engineer at the University of Surrey (CVSSP), working on the EPSRC-funded AI for Sound project under Prof. Mark Plumbley. I published at ICASSP, WASPAA, ICWE, CHiME, Inter-Noise and SMC, built open-source tools, and supervised student projects. Before Surrey I spent four years at Ikatu developing embedded audio systems for Bang & Olufsen home automation products, which shaped how I think about the gap between research prototypes and devices that actually ship.

I work primarily in Python and PyTorch, and I am equally comfortable with microcontrollers, soldering irons, and signal processing from first principles. My background combines electrical engineering (Universidad de la República, Uruguay) and an MSc in Sound and Music Computing (Universitat Pompeu Fabra, Barcelona, 2021).

Currently based in Montevideo, Uruguay. Italian citizen with EU work authorization. Open to remote roles in LATAM/Europe and relocation within the EU.

Beyond my research work, I am an electronic music DJ and producer with over a decade of practice and formal musical training. Most of my interest in how machines listen started from how I listen myself.

Email: gabobibbo@gmail.com

Links: [ORCID] | [Scholar] | [Github] | [LinkedIn]

Publications & Works
Speech Removal Framework
Privacy for Audio AI: Risks, Challenges, and Emerging Solutions in the Era of Audio AI [Panel discussion]
Thomas Deacon; Jennifer Williams; Jason R. C. Nurse; Christopher Hicks; Gabriel Bibbó; Arshdeep Singh and Mark D. Plumbley
2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio
Identifier: 991036566602346 | AES program
Speech Removal Framework
Speech Removal Framework for Privacy-preserving Audio Recordings
Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley
2025 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Tahoe City, CA, October 2025.
DOI: 10.5281/zenodo.17050321 | HF online demo
Room Acoustics and Microphones
Room Acoustics and Microphone Characteristics Show Systematic Impact on Sound Event Recognition
Gabriel Bibbó; Craig Cieciura; Mark D. Plumbley
Proceedings of the 54th International Congress and Exposition on Noise Control Engineering (Inter-Noise 2025), São Paulo, Brazil, August 2025.
ISBN: 978-65-272-1573-8
Integrating IP Broadcasting with Audio Tags
Integrating IP broadcasting with audio tags: Workflow and challenges
Rhys Burchett-Vass; Arshdeep Singh; Gabriel Bibbó; Mark D. Plumbley
2025 AES International Conference on Artificial Intelligence and Machine Learning for Audio
open research | preprint
Soundscape Experience Mapping
Soundscape Experience Mapping: A Deep Listening Approach for Eliciting Older Adults' Perceptions of Indoor Soundscapes
Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley
Forum Acusticum / Euronoise 2025 (11th Convention of the European Acoustics Association), Málaga, Spain, June 2025.
link
Personalized Live Sound Recognition PANNs
Personalized Live Sound Recognition Using Efficient PANNs [Show and Tell]
Arshdeep Singh; Gabriel Bibbó; Thomas Deacon; Haohe Liu; Mark D. Plumbley
IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, April 2025.
link
Environmental Sound Classification Embedded
Environmental sound classification on an embedded hardware platform
Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley
INTER-NOISE and NOISE-CON Congress and Conference Proceedings, Nantes, France, August 2024.
DOI: 10.3397/in_2024_3723
Sounds of Home Dataset
The Sounds of Home: A Speech-Removed Residential Audio Dataset for Sound Event Detection
Gabriel Bibbó; Thomas Deacon; Arshdeep Singh; Mark D. Plumbley
8th International Workshop on Speech Processing in Everyday Environments (CHiME 2024), Kos Island, Greece, September 2024.
DOI: 10.21437/chime.2024-11
Soundscape Personalisation at Work
Soundscape Personalisation at Work: Designing AI-Enabled Sound Technologies for the Workplace
Thomas Deacon; Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley
International Conference on Sound and Music Computing (SMC 2024), Porto, Portugal, July 2024.
paper
Raspberry Pi Sound Event Recognition Demo
Recognise and Notify Sound Events Using a Raspberry PI Based Standalone Device [Demo]
Gabriel Bibbó; Arshdeep Singh; Mark D. Plumbley
IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA 2023), New York, U.S.A, October 2023.
DOI: 10.5281/zenodo.15465882 | video/demo
Harmonic EDM Mixing Compatibility
A New Compatibility Measure for Harmonic EDM Mixing
Gabriel Bibbó Frau; Ángel Faraldo
International Conference on Web Engineering (ICWE 2022), Springer, Bari, Italy, July 2022.
DOI: 10.1007/978-3-031-09917-5_37
Thesis Harmonic EDM Mixing
Towards a New Compatibility Measure for Harmonic EDM Mixing
Gabriel Bibbó; Angel Faraldo
Dissertation or Thesis, Universitat Pompeu Fabra, October 2021.
DOI: 10.5281/zenodo.5554688
Thesis Mobile Robots SDR
Autonomous Mobile Robots Comunicated by Software Defined Radio
Gabriel Bibbó; Mariana Gelós; Martín Randall; Pablo Belzarena; Federico Larroca
Dissertation or Thesis, Universidad de la República (Uruguay), December 2017.
link
Projects
3HATO Project
3H - ATO (Third Hand - Avoid Touching Objects)
Creator (Feb 2020 - Aug 2022), Associated with Universidad de la República.
Mechanical device to avoid contact with contaminated surfaces (bus handrails, doors, buttons). Features an optimized shape, lightweight, and sanitizable.
Promotional Video
IoT Soap Dispenser Project
Automatic IoT soap dispenser
Designer and Developer (Apr 2020 - Feb 2021)
IoT device for hand washing in the meat industry. Stainless steel, WiFi, cloud platform, IR/RFID sensors, and 3-litre capacity.
UyVoy App Project
UyVoy Mobile App
Project Manager (Mar 2020 - Aug 2020)
Blockchain-based mobile app for booking appointments to avoid crowds during the pandemic. Supported by Aeternity, emerged from HackCovid19 (ORT Uruguay).
Experience & Activities

Visiting Researcher (collaboration), University of Surrey, Guildford, UK (Dec 2025 - present)
Continuing research collaboration with Mark D. Plumbley and Simone Spagnol on audio-language models for voice activity detection and privacy-preserving audio. The work includes LoRA fine-tuning, prompt optimization of Qwen-Audio family models, and evaluation under acoustic degradations. It builds on research started during my role at the University of Surrey.

Research Engineer in Sound Sensing, University of Surrey, Guildford, UK (Nov 2022 - Nov 2025)
Developed AI-driven sound sensing systems for the AI for Sound project (ai4s.surrey.ac.uk), including software, libraries, datasets, and pilot POCs for real-world smart environments. Designed, deployed, and evaluated research prototypes through iterative cycles, combining deep learning, audio signal processing, user feedback, and co-design principles. Built privacy-preserving audio resources and evaluation pipelines for sound event detection, voice activity detection, embedded sound recognition, and acoustic robustness studies. Published and presented peer-reviewed research outcomes at international conferences, promoted project activities through CVSSP and AI for Sound, and supervised bachelor's and master's degree final projects.

Technical Support Engineer - Google Workspace, Webhelp, Barcelona, Spain (Mar 2022 - Nov 2022)
Tier 3 technical support in cloud services for Google Workspace enterprise customers.

IT Auditor, KPMG, Barcelona, Spain (Nov 2021 - Mar 2022)
Support to telecommunications companies or IT departments in audit services.

R&D Engineer, Ikatu, Montevideo, Uruguay (Aug 2016 - Dic 2019)
Designed and shipped embedded and IoT systems for Bang & Olufsen home automation products: low-level drivers, hardware integration, and Internet connectivity. Owned product lifecycle work across requirements, architecture, implementation, testing, validation, and customer-facing documentation. Trained and onboarded incoming programmers on embedded development practices.

Intern, Ikatu, Montevideo, Uruguay (Apr 2016 - Jul 2016)
Developed and coordinated a complete home automation system project.

Affiliate Member, IEEE Signal Processing Society (Member #101096528) (Jan 2025 - Dec 2025)

Grant: AI for Sound, Engineering and Physical Sciences Research Council (EPSRC) (Apr 2020 - Dec 2025)
Part of the team working on the EP/T019751/1 grant to bring "AI for Sound" technology out of the lab.
Education

Master's Degree in Sound and Music Computing, Universitat Pompeu Fabra, Barcelona (2021).

Bachelor's Degree in Electrical Engineering (spec. Signal Processing), Universidad de la República, Uruguay (2017).

Music school "Virgilio Scarabelli Alberti", Montevideo (Musical language, guitar, ensembles) (2005).
Skills

Certifications: PRINCE2® Foundation in Project Management, Deep Learning Specialization (Coursera), Machine Learning (Stanford/Coursera), Audio Signal Processing for Music Applications (Coursera), Electronic Music Production (AURA).

Technical Skills:
AI/ML: Deep Learning (PyTorch), Audio Language Models, CNNs, Transformers, Model Compression, Supervised & Unsupervised Learning, Fine-tuning, Prompt Optimization Frameworks, Privacy Preserving Machine Listening, Sound Event Detection, Voice Activity Detection, Edge AI/TinyML.
Audio: Music Information Retrieval, Digital Signal Processing, Time-frequency Processing, Digital Filters, Audio Features, Synthesis, Psychoacoustics, Real-Time Systems.
Programming: Python (NumPy, SciPy, TorchAudio), C/C++, MATLAB, Arduino, Git, Linux CLI, SDR Programming, HTML.
Hardware: Embedded & Microprocessor Systems, RTOS, Analog & Digital Electronics, Wireless Communications, Control Theory.
Research: User-Centred Design, Experiment Design, Ethics, GDPR, Technical Communication, Collaboration, PRINCE2 Methodologies, Open-Source Development, FAIR Principles, Proposal Writing.

Languages: Spanish (Native), English (C1), Portuguese (A2)