Skip to content
View andrecorumba's full-sized avatar

Sponsoring

@tiangolo

Highlights

  • Pro

Block or report andrecorumba

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
andrecorumba/README.md

Hi there 👋 I'm André Rocha @andrecorumba

🌟 About Me:

🤖 Graduated in Computer Engineering since 2003. I've been dedicated to applying computational thinking to overcome real-world challenges.

🚀 What I'm Working On:

🇧🇷 I've been working as a Machine Learning Engineer and Python Developer. Currently, I'm focused on developing Python applications that harness the capabilities of Large Language Model (LLM) frameworks. These tools are crafted to assist auditors in sifting through vast arrays of documents and files, aiming to pinpoint fraudulent activities or irregularities.

📈 Expertise and Interests:

  • Data Analysis & Auditing: Expert in cross-referencing large relational databases, creating auditing trails to detect and prevent fraud in public fund management.
  • Programming & Development: Proficient in crafting solutions that integrate advanced technologies, focusing on automation, data analysis, and software development.

🛠 Technologies & Tools

  • Languages: Python
  • Libraries: FastAPI, Pandas, PyTorch, LangChain, HuggingFace, Transformers
  • Database: Redis, SQLServer
  • DevOps: Linux, Docker, Google Cloud

📚 Projects

  • Inspector - The Inspector is a Proof of Concept (POC) consisting of a web application and scripts, written in Python, that analyze various types of documents. It utilizes GPT-3.5 and GPT-4 language models from OpenAI to provide responses based on questions asked by the user.
  • Leia - LeIA is an application that uses OpenAI's artificial intelligence models for audio and video transcription. You can transcribe new files or consult cases that have already been transcribed.

🏆 Achievements

  • An artificial intelligence solution to analyze documents and assist Brazilian auditors in audit processes, utilizing large language models.
  • An advanced artificial intelligence solution designed for analyzing audio recordings, aiding in forensic analysis. This system leverages techniques to assist in the examination of audio data, streamlining the process of uncovering crucial insights for forensic investigations.
  • Cross-referencing data to alert about fraud detection in transactions.

🌐 Looking Forward:

I am continually seeking to expand my knowledge and skills, diving deeper into the realms of machine learning, data science, and full-stack development. My goal is to create tools and applications that not only solve complex problems but also make a significant impact by improving efficiency and transparency.

Thank you for visiting my profile! Feel free to explore my repositories and reach out if you have any questions or collaboration ideas.

📄 Papers

Paiva, E., Pereira, F., Carvalho, D., Junior, N., Oliveira, R., Bonifácio, S., Rocha, A., Oliveira, H., Cezar, F., & Junior, H. (2024). Continued pre-training of LLMs for Portuguese and Government domain: A proposal for product identification in textual purchase descriptions. Proceedings of the Association for the Advancement of Artificial Intelligence (AAAI). https://openreview.net/pdf?id=HBDb1ybEcs

Rocha, A. L. M., Rezende, M. S., & Oliveira, T. C. (2022). Alice: Desafios, resultados e perspectivas da ferramenta de auditoria contínua de compras públicas governamentais com uso de inteligência artificial. Revista da CGU, 14(26), 296-308. https://doi.org/10.36428/revistadacgu.v14i26.530

💬 Let's Connect

Pinned Loading

  1. leia leia Public

    Projeto para transcrição de áudios e vídeos.

    Python 4

  2. inspector inspector Public

    Inspector is a Proof of Concept (POC) for a Python-based web application designed to analyze documents using cutting-edge AI technologies.

    Python 2 2