Publications

Journal

2025

R. Silva, H. Amamou, L. Ferraz, F. Silva, A. Avila,

R. Silva, H. Amamou, L. Ferraz, F. Silva, A. Avila, “Fake News Detection in Portuguese Under Large Language Model-Generated Content,” Journal of the Brazilian Computer Society, accepté.

J.V. Souza, H. Amamou, R. Chen, E. Salari, R. Gulbemann, C. Niklaus, T. Serpa, M. Lima, P. Pinto, S. Kshirsagar, A. Davoust, S. Hardschuh, A. Avila,

Cross-Lingual Keyword Extraction for Pesticide Terminology in Brazilian Portuguese and English,” Journal of the Brazilian Computer Society, accepté.

2024-2023

O. Mengara, A. Avila, and T. Falk,

Backdoor Attacks to Deep Neural Networks: A Survey of the Literature, Challenges, and Future Research Directions, IEEE Access, Vol. 12, Jan. 2024.

A. Pimentel, H. Guimarães, A. Avila, T. Falk,

Environment-Aware Knowledge Distillation for Improved Resource-Constrained Edge Speech Recognition, Applied Sciences, Vol.13, No.23, Nov. 2023.

2022

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement, NeurIPS ENLSP Workshop, 2022.

A. Pimentel, H. Guimarães, A. Avila, M. Rezagholizadeh, T. Falk,

How Robust is Robust wav2vec 2.0 for Edge Applications?: An Exploration into the Effects of Quantization and Model Pruning on “In-the-Wild” Speech Recognition, Edge Intelligence Workshop, 2022. 

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for ”In the Wild” Edge Applications, Edge Intelligence Workshop, 2022.

2021

A. Avila, D. O’Shaughnessy, T. Falk,

Automatic Speaker Verification from Affective Speech Using Gaussian Mixture Model Based Estimation of Neutral Speech Characteristics, Speech Communications, Volume 132, September 2021, Pages 21-31.

A. Avila, J. Alam, F. Prado, D. O’Shaughnessy, T. Falk,

On the Use of Blind Channel Response Estimation and a Residual Neural Network to Detect Physical Access Attacks to Speaker Verification Systems, Computer Speech & Language, Volume 66, March 2021.

2020 -2019

A. Avila, D. O’Shaughnessy, T. Falk,

Non-intrusive Speech Quality Prediction Based on the Blind Estimation of Clean Speech and the i-vector Framework, Quality and User Experience, Vol. 5, No. 11, Nov. 2020.

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

On the Use of the I-vector Speech Representation for Instrumental Quality Measurement, Quality and User Experience, Vol. 5, No. 6, June 2020.

B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar,

Free-Reference Image Quality Assessment Framework using Metrics Fusion and Dimensionality Reduction, Signal & Image Processing, Vol. 10, No. 5, 14 pages, Oct. 2019.

2018

A. Avila, Z. Akhtar, J. Santos, D. O’Shaughnessy, T. Falk,

Feature Pooling for Spontaneous Speech Based Emotion Recognition In-the-Wild, IEEE Trans. Affective Computing, Vol. 12, No.1, 2021, pp. 177-188.

Conferences

2026

A. Fursule, S. Kshirsagar, and A. Avila.

Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis. IEEE Conference on Artificial Intelligence, CAI 2026.

Y. Sadfi, H. Amamou, A. Davoust, A. Avila.

Comparative Analysis of Machine Learning, LLMs, and RAG for Fake News Detection. IEEE Conference on Artificial Intelligence, CAI 2026.

A. Anacin, S. Kshirsagar, A. Avila.

Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments. IEEE Conference on Artificial Intelligence, CAI 2026.

L. Ferraz, J. Leal, A. Avila, T. Pardo, F. Batista, R. Silva.

Retrieval-Augmented Generation with Small Language Models for Fake News Detection. International Conference on Computational Processing of Portuguese, PROPOR 2026.

M. Doghmane, H. Amamou, S. Gagnon, A. Davoust, and A. Avila.

Infox-QC: A Quebec-Focused French Corpus for Misinformation Detection and AI Robustness Assessment. Language Resources and Evaluation Conference, LREC 2026.

V. Nallaguntla, A. Fursule, S. Kshirsagar, and A. Avila.

PhonemeDF: A Synthetic Speech Dataset for Audio Deepfake Detection and Naturalness Evaluation. Language Resources and Evaluation Conference, LREC 2026.

2025

N. Soltani, S. Shamsi, Z. Abou El Houda, R. Khoury, K.Costa, T. Falk, and A. Avila,

Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks, IEEE SMC 2025

H. Amamou, S. Gagnon, A. Davoust, and A. Avila,

Towards Robust Retrieval-Augmented Generation Based on Knowledge Graph: A Comparative Analysis, IEEE SMC 2025

D. Temmar, A. Hamadene, V. Nallaguntla, A, Fursule, M. Allili, S. Kshirsagar, and A. Avila,

Phonetic Analysis of Real and Synthetic Speech Using HuBERT Embeddings: Perspectives for Deepfake Detection, IEEE SMC 2025

2025

N. Soltani, S. Shamsi, Z. Abou El Houda, R. Khoury, K.Costa, T. Falk, and A. Avila,

Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks, IEEE SMC 2025

H. Amamou, S. Gagnon, A. Davoust, and A. Avila,

Towards Robust Retrieval-Augmented Generation Based on Knowledge Graph: A Comparative Analysis, IEEE SMC 2025

D. Temmar, A. Hamadene, V. Nallaguntla, A, Fursule, M. Allili, S. Kshirsagar, and A. Avila,

Phonetic Analysis of Real and Synthetic Speech Using HuBERT Embeddings: Perspectives for Deepfake Detection, IEEE SMC 2025

2024

H. Guimarães, A. Pimentel, A. Avila, and T. Falk,

VIC-KD: Variance-Invariance-Covariance Knowledge Distillation to Make Keyword Spotting More Robust Against Adversarial Attacks, ICASSP 2024

M. Simão, F. Prado, O. Wahab, and A. Avila,

2024, December. Tempcharbert: Keystroke dynamics for continuous access control based on pre-trained language models. In 2024 IEEE International Workshop on Information Forensics and Security (WIFS) (pp. 1-6). IEEE.

2023

H. Guimarães, Y. Zhu, O. Mengara, A. Avila, and T. Falk,

Assessing the Vulnerability of Self-Supervised Speech Representations for Keyword Spotting under White-Box Adversarial Attacks, IEEE SMC 2023.

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, B. Chen, and T. Falk,

RobustDistiller: Compressing Universal Speech Representations for Enhanced Environment Robustness, ICASSP 2023.

O. A. Wahab and A. Avila,

(2023, December). A Max-Min Security Game for Coordinated Backdoor Attacks on Federated Learning. In 2023 IEEE International Conference on Big Data (BigData) (pp. 3566-3573). IEEE.

A. Pimentel, H. R. Guimarães, A. Avila, and T. H. Falk,

(2023). Environment-aware knowledge distillation for improved resource-constrained edge speech recognition. Applied Sciences, 13(23), 12571.

R. Khoury, A. R. Avila, J. Brunelle and B. M. Camara.

How Secure is Code Generated by ChatGPT?”, IEEE Systems, Man, and Cybernetics (SMC), Maui, HA, USA, Oct. 2023.

2022

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

Improving the Robustness of DistilHuBERT to Unseen Noisy Conditions via Data Augmentation, Curriculum Learning, and Multi-Task Enhancement, NeurIPS ENLSP Workshop, 2022.

A. Pimentel, H. Guimarães, A. Avila, M. Rezagholizadeh, T. Falk,

How Robust is Robust wav2vec 2.0 for Edge Applications?: An Exploration into the Effects of Quantization and Model Pruning on “In-the-Wild” Speech Recognition, Edge Intelligence Workshop, 2022. 

H. Guimarães, A. Pimentel, A. Avila, M. Rezagholizadeh, T. Falk,

An Exploration into the Performance of Unsupervised Cross-Task Speech Representations for ”In the Wild” Edge Applications, Edge Intelligence Workshop, 2022.

A.Avila, etal.

Low-bit Shift Network for End-to-End Spoken Language Understanding, Interspeech 2022, DOI: 10.21437/Interspeech.2022-760.

2021

Cao, Y., Potdar, N., Avila, A.R.

(2021) Sequential End-to-End Intent and Slot Label Classification and Localization. Proc. Interspeech 2021, 1229-1233, DOI: 10.21437/ Interspeech.2021-1569

N.Potdar, A.Avila,C.XING, et al.

A Streaming End-to-End Framework For Spoken Language Understanding, IJCAI 2021, pp. 3906-3914, DOI: 10.24963/ijcai.2021/538.

2019

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Blind Channel Response Estimation for Replay Attack Detection, Interspeech 2019, pp. 2893-2897, DOI: 10.21437/Interspeech.2019-2956.

A. Avila, H. Gamper, C. Reddy, R. Cutler, I. Tashev, J. Gehrke,

Non-intrusive speech quality assessment using neural networks, ICASSP 2019 – 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 631-635, DOI: 10.1109/ICASSP.2019.8683175.

A. Avila, S. Kshirsagar, A. Tiwari, D. Lafond, D. O’Shaughnessy, and T. Falk,

Speech-Based Stress and Emotion Classification Based on Modulation Spectral Features and Convolutional Neural Networks, 27th European Signal Processing Conference (EUSIPCO) 2019, pp. 1-5, DOI: 10.23919/EUSIPCO.2019.8903014.

B. Sadou, A. Lahoulou, T. Bouden, A. Avila, T. Falk, Z. Akhtar,

Blind Image Quality Assessment using SVD based Dominant Eigenvectors for Feature Selection, SIPRO 2019.

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Intrusive Quality Measurement of Noisy and Enhanced Speech based on i-Vector Similarity, QoMEX 2019, pp. 1-5, DOI: 10.1109/QoMEX.2019. 8743285.

Nominated for Best Paper Award

2018-2016

A. Avila, J. Alam, D. O’Shaughnessy, T. Falk,

Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition, Interspeech 2018, pp. 3663-3667, DOI: 10.21437/Interspeech.2018-2350..

A. Avila, J. Monteiro, D. O’Shaughnessy, T. Falk,

Speech Emotion Recognition on Mobile Devices Using a New Modulation Spectrum Pooling and Deep Neural Networks, ISSPIT 2017, 2017 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), pp. 360-365, DOI: 10.1109/ISSPIT.2017.8388669.

A. Avila, B. Cauchi, S. Goetze, S. Doclo, T. Falk,

Performance Comparison of Intrusive and Non-instrusive Instrumental Quality Measures for Enhanced Speech, IWAENC 2016, 2016 IEEE International Workshop on Acoustic Signal Enhancement (IWAENC), pp. 1-5, DOI: 10.1109/IWAENC.2016.7602907.

2014

A. Avila, M. Santos, F. Fraga, and T. Falk,

The Effect of Speech Rate on Automatic Speaker Verification: a Comparative Analysis of GMM-UBM and I-vector Based Methods, 12th Audio Engineering Conference (AES-Brazil), May 2014.

A. Avila, M. Paja, F. Fraga, D. O’Shaughnessy, and T. Falk,

Improving the Performance of Far-Field Speaker Verification Using Multi-Condition Training: The Case of GMM-UBM and i-vector Systems, Interspeech’2014.

A. Avila, M. Santos, F. Fraga, and T. Falk,

Investigating the use of Modulation Spectral Features within an Ivector Framework for Far-Field Automatic Speaker Verification, International. Telecommunications Symposium, 2014.

2013-2011

A. Avila, F. Prado, G. Kobayashi, E. Rocha,

Performance Comparison of Overdetermined Multilateration Algorithms for Estimating Aircraft Position. In: Workshop on Distance Geometry and Applications (DGA), 2013, Manaus.

A. Avila, M. Paja, F. Fraga,

Proposta de um Sistema de Diálogo Automático Baseado em Algoritmos de Aprendizado Por Reforço. In: Proceedings of the 10th AES Brazil Conference. Rio de Janeiro: Audio Engineering Society, 2012. v. 1. p. 75-78.

A. Avila, M. Paja, F. Fraga,

Integracão de Sistemas de Reconhecimento, Tradução e Síntese Automática da Fala para Facilitar a Comunicação de Turistas. In: The 14th LAC AES Conference. Montevideo: Audio Engineering Society, 2011. v. 1, p. 1-4.

Contact

Please enable JavaScript in your browser to complete this form.
Scroll to Top