Page Not Found
Page not found. Your pixels are in another canvas.
A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.
Page not found. Your pixels are in another canvas.
About me
This is a page not in th emain menu
Published:
This post will show up by default. To disable scheduling of future posts, edit config.yml
and set future: false
.
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Published:
This is a sample blog post. Lorem ipsum I can’t remember the rest of lorem ipsum and don’t have an internet connection right now. Testing testing testing this blog post. Blog posts are cool.
Automatic Speech Recognition in brazilian portuguese. Click here to learn more.
Speech Synthesis in brazilian portuguese. Click here to learn more.
Multi-Lingual Voice Cloning. Click here to learn more.
Speech-lip Synchronizing. Click here to learn more.
KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Click here to learn more.
KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models using YouTube. Click here to learn more.
PTL-AI Furnas Dataset: A Public Dataset for Fault Detection in Power Transmission Lines Using Aerial Images. Click here to learn more.
Published in Proceedings of Sixth International Congress on Information and Communication Technology, 2021
This study identified asymmetric associative patterns in health-related data using the Health Association Rules (HAR) algorithm.
Recommended citation: de Castro Rodrigues, D. et al. (2022). "Discovering Associative Patterns in Healthcare Data". In: Yang, XS., Sherratt, S., Dey, N., Joshi, A. (eds) Proceedings of Sixth International Congress on Information and Communication Technology. Lecture Notes in Networks and Systems, vol 235. Springer, Singapore. https://doi.org/10.1007/978-981-16-2377-6_35 https://link.springer.com/chapter/10.1007/978-981-16-2377-6_35
Published in Brazilian Conference on Intelligent Systems, 2021
In this paper we present an efficient method for training models for speaker recognition using small or under-resourced datasets.
Recommended citation: Edresson Casanova, Arnaldo Candido Junior, Christopher Shulby, Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Hamilton Pereira da Silva, Sandra Maria Aluísio, and Moacir Antonelli Ponti. 2021. "Speech2Phone: A Novel and Efficient Method for Training Speaker Recognition Models". In Intelligent Systems: 10th Brazilian Conference, BRACIS 2021, Virtual Event, November 29 – December 3, 2021, Proceedings, Part II. Springer-Verlag, Berlin, Heidelberg, 572–585. https://doi.org/10.1007/978-3-030-91699-2_39 https://link.springer.com/chapter/10.1007/978-3-030-91699-2_39
Published in Language Resources and Evaluation, 2021
This work consists of creating publicly available resources for Brazilian Portuguese in the form of a novel dataset along with deep learning models for end-to-end speech synthesis.
Recommended citation: Casanova, E., Junior, A.C., Shulby, C. et al. "TTS-Portuguese Corpus: a corpus for speech synthesis in Brazilian Portuguese". Lang Resources & Evaluation 56, 1043–1055 (2022). https://doi.org/10.1007/s10579-021-09570-4 https://link.springer.com/article/10.1007/s10579-021-09570-4
Published in International Conference on Computational Processing of the Portuguese Language., 2021
This work presents the development of an public Automatic Speech Recognition system using only open available audio data, from the fine-tuning of the Wav2vec 2.0 XLSR-53 model pre-trained in many languages over Brazilian Portuguese data.
Recommended citation: Stefanel Gris, L. R., Casanova, E., Oliveira, F. S. D., Silva Soares, A. D., & Candido Junior, A. (2022, March). "Brazilian Portuguese Speech Recognition Using Wav2vec 2.0". In International Conference on Computational Processing of the Portuguese Language (pp. 333-343). Springer, Cham. https://link.springer.com/chapter/10.1007/978-3-030-98305-5_31
Published in Interspeech, 2021
In this paper, we propose SC-GlowTTS: an efficient zero-shot multi-speaker text-to-speech model that improves similarity for speakers unseen during training.
Recommended citation: Casanova, E., Shulby, C.D., Gölge, E., Müller, N.M., Oliveira, F.S., Júnior, A.C., Soares, A.D., Aluísio, S.M., & Ponti, M.A. (2021). "SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model". "Interspeech". https://www.isca-speech.org/archive/pdfs/interspeech_2021/casanova21b_interspeech.pdf
Published in ArXiv, 2021
This paper presents CORAA (Corpus of Annotated Audios) v1. with 290.77 hours, a publicly available dataset for ASR in Brazilian Portuguese containing validated pairs (audio-transcription).
Recommended citation: Candido Junior, A., Casanova, E., Soares, A. et al. "CORAA ASR: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian Portuguese." Lang Resources & Evaluation (2022). https://doi.org/10.1007/s10579-022-09621-4 https://link.springer.com/article/10.1007/s10579-022-09621-4
Published in SIBGRAPI 2022, 2022
We present a new images dataset called PTL-AI Furnas Dataset as a new benchmark for fault detection in power transmission lines.
Recommended citation: Oliveira et al. "PTL-AI Furnas Dataset: A Public Dataset for Fault Detection in Power Transmission Lines Using Aerial Images." SIBGRAPI 2022 (2022). http://sibgrapi.sid.inpe.br/col/sid.inpe.br/sibgrapi/2022/09.22.22.53/doc/oliveira-33_inpe.pdf
Undergraduate course, UFMT, Engineering Department, 2022
Undergraduate course, UFMT, Engineering Department, 2022