AgentIQX-Multiagentic-Gradio-App

AgentIQX πŸš€

Modern Multi-Agent Audio/Video/PDF Transcription & Summarization App

AgentIQX Banner

Python
Gradio
Whisper
FAISS
TinyLlama
PyPDF2
gTTS
MIT License


πŸ’» Tech Stack

| Layer | Technology | |β€”β€”-|———–| | Frontend/UI | Gradio Blocks, Custom CSS (Glassmorphic, Dark/Cyan theme) | | Transcription | OpenAI Whisper (local GPU/CPU) | | Summarization / LLM | Ollama (TinyLlama, DeepSeek, Phi3) | | Retrieval / Search | SentenceTransformers (all-MiniLM-L6-v2), FAISS | | PDF Handling | PyPDF2, PyMuPDF | | Text-to-Speech | gTTS (local), pyttsx3 | | Email Automation | SMTP, dotenv credentials | | Utilities | Python 3.11+, Logging, Chunking, FAISS Index Management |


APP_SNAPSHOTS

Screenshot 1 Screenshot 2 Screenshot 3 Screenshot 4 Screenshot 5 Screenshot 6 Screenshot 7 Screenshot 8

✨ Key Features

1. Transcribe Anything

2. Flexible Summarization

3. Retrieval-Augmented QA (RAG)

4. Step-by-Step Explanations

5. Semantic Embedding & Chunk Management

6. Text-to-Speech (TTS)

7. Email Automation

8. Glassmorphic, Responsive UI

9. Robust Logging & Error Handling


πŸ“Š Workflow Overview

  1. Upload & Extract β†’ PDF / Audio / Video β†’ Raw text via Whisper & multi-method extraction.
  2. Chunk & Embed β†’ Text split into chunks, embedded (SentenceTransformers), indexed (FAISS).
  3. Summarize β†’ Select style, level of detail, focus areas β†’ Local LLM returns summary.
  4. TTS & Email β†’ Speak summary using gTTS β†’ Email via SMTP with secure validation.
  5. RAG Q&A β†’ Ask questions β†’ Relevant chunks retrieved β†’ Answer generated.
  6. Explain Agent β†’ Generate step-by-step rationale for any answer/context.

⚑ Advanced Features


πŸš€ Getting Started

```bash

Create a fresh virtual environment and install dependencies

pip install -r requirements.txt

Set up .env for email credentials

echo β€œSENDER_EMAIL=your_email@domain.com\nSENDER_PASSWORD=yourpassword” > .env

Launch AgentIQX

python app.py