New
Flows are already here! 🤖
Create WhatsApp agents with AI in minutes, without writing code.

Automate workflows and connect over 400 apps in a visual, simple, and hassle-free way

Discover FlowsStart for free
Flows
Popular
You are gonna like it 😎
Quickly send messages to a contact list without programming.

Easy, fast and without using complex tools

Discover CampaignsTry for free
Campañas
New
We have a new mobile app! 📱
Manage your campaigns from anywhere.

Fast, easy and always with you.

Download appMore information
App Móvil
logo
  • Product
    Core Features
    • Multi-user chat
      Collaborative team management
    • Analytics
      Detailed metrics and reports
    • Integrated CRM
      Complete contact management
    • Access Permissions
      Granular control per user
    • WhatsApp Business AppNew
      Migrate your number in minutes
    AI and Automation
    • FlowsNew
      No-code automation with AI
    • AI AssistantNew
      ChatGPT integrated in WhatsApp
    • BellsNew
      Smart bulk messages
    • Automatic replies
      24/7 automated support
    Platforms
    • Mobile AppNew
      iOS and Android available
    • REST API
      Complete API integration
    • Integrations
      Connect with 400+ apps
    Use Cases
    • E-commerce
      Sales via WhatsApp
    • Soporte
      Customer support
    • Marketing
      Campaigns and engagement
    Ready to get started?
    Try free for 7 daysCreate free account
  • Resources
    Tools
    • WABA CalculatorNew
      Estimate WhatsApp API costs
    • Link Generator
      Create WhatsApp links
    • Chat Button
      Widget for your website
    • Channel Button
      Promote your channel
    Learning
    • Help Center
      Guides and documentation
    • Tutorials
      Step-by-step examples
    • API Tester
      Test the API live
    • API Documentation
      Complete reference
    Community
    • Blog
      News and updates
    • Service Status
      Uptime and maintenance
    Need help?

    Our team is ready to help you get the most out of PulpoChat

    Go to Help CenterLive chat
  • Pricing
  • Partners
    • Affiliate program
    • Reseller Program
AccessStart for free
Start for freeAccess
  • Pricing
  • Product
    Core Features
    • Multi-user chat
    • Analytics
    • Integrated CRM
    • Access Permissions
    AI and Automation
    • FlowsNew
    • AI AssistantNew
    • BellsNew
    • Automatic Replies
    Platforms
    • Mobile AppNew
    • REST API
    • Integrations
    Use Cases
    • E-commerce
    • Soporte
    • Marketing
  • Resources
    Tools
    • WABA CalculatorNew
    • Link Generator
    • Chat Button
    • Channel Button
    Learning
    • Help Center
    • Tutorials
    • API Tester
    • API Documentation

    Community

    • Blog
    • Service Status
  • Partners
    • Affiliate program
    • Reseller Program
New
Flows are already here! 🤖
Create WhatsApp agents with AI in minutes, without writing code.

Automate workflows and connect over 400 apps in a visual, simple, and hassle-free way

Discover FlowsStart for free
Flows
Popular
You are gonna like it 😎
Quickly send messages to a contact list without programming.

Easy, fast and without using complex tools

Discover CampaignsTry for free
Campañas
New
We have a new mobile app! 📱
Manage your campaigns from anywhere.

Fast, easy and always with you.

Download appMore information
App Móvil
← Back To Flows

AI Agent Bot that understands Text, Audio, Image and Documents

Use for free

Description

Create your Custom Business AI Agent that speaks, sees, listens and replies to your customers.

🚀 What this workflow does

  1. Receives any inbound WhatsApp message via a PulpoChat Trigger
  2. Detects the medium – text, voice note, image or document (PDF)
  3. Processes accordingly
    • Text → straight to the AI brain
    • Voice notes → download ➜ Whisper transcription
    • Images → download ➜ GPT-4o Vision analysis
    • PDFs only → download ➜ text extraction
  4. Feeds the cleaned input + short-term memory buffer (20 turns) to an OpenAI Chat Agent (GPT-4o-mini by default)
  5. Sends the answer back through PulpoChat:
    • If the user sent audio, the bot replies in audio (OpenAI TTS ➜ saves mp3 to Google Drive ➜ returns the public link).
    • Otherwise, returns plain text.
  6. Gracefully rejects anything that isn’t text, image, audio or a PDF (“Sorry, you can only send …”)

Result: a polite, context-aware concierge that can read your contract, describe your cat photo, or summarize a 3-minute rant into a single line—without ever leaving WhatsApp.

🧩 Key components

Node Purpose
PulpoChat Trigger / PulpoChat Receive & send WhatsApp messages
Switch → “Input type” Routes to Text / Audio / Image / Document branches
HTTP Request Securely downloads media from PulpoChat
OpenAI Whisper Turns voice notes into text
GPT-4o Vision Describes images in detail
Extract From File Converts PDFs to text
LangChain Agent Central brain with custom system prompt
Memory Buffer Window Keeps the last 20 turns per chat
OpenAI TTS (“Generate Audio Response”) Converts answers to speech (voice “nova”)
Google Drive (Upload + Delete) Stores the mp3, grabs a share link, cleans up

(Sticky notes in the canvas label the four media lanes so future-you won’t get lost.)

🛠️ Prerequisites

  • PulpoChat device + API key
  • OpenAI API key (chat, whisper, TTS, vision)
  • Google Drive OAuth credentials (for audio replies)

💡 Ideas & extensions

  • Pipe extracted conversation data into HubSpot or Airtable.
  • Replace GPT-4o with your on-prem model ➜ just swap the Chat node.
  • Add a Sentiment node to auto-escalate angry customers.
  • Expand document branch to Word, PowerPoint or spreadsheets.

⚖️ Limits & best-practice nudges

  • Only PDFs are accepted for now; other file types trigger a polite rejection.
  • The workflow rate-limits itself by design (single execution per message), but you may want extra guards if you point it at a large audience.
  • Delete Google Drive files after sending (already included) to keep storage costs clean.
  • Remember WhatsApp’s 24-hour customer-initiated window.

🏁 Ready, set, automate!

Import → Hit Active. Your WhatsApp number just became a futuristic, multimodal AI agent. Enjoy the peace and quiet while it handles the chatter. 😉

Automate Anything on WhatsApp

Non-Code Automation Workflows with 10+ Ready-to-Uuse Templates

Use This Ready-To-Puse Template and try it out for free in minute!

Start automating whatsapp for free

Discover More Templates

  • AI Agent Chatbot with Memory storage
    Abrir plantilla
  • Auto-assign WhatsApp Chats to Departments and Users with AI
    Abrir plantilla
  • AI WhatsApp Agent: Data Training & Smart Customer Support
    Abrir plantilla
  • AI Agent Bot that understands Text, Audio, Image and Documents
    Abrir plantilla
  • WhatsApp Appointments AI Agent with Google Calendar integration
    Abrir plantilla
  • AI Agent with Supabase Datastore
    Abrir plantilla
  • General-purpose WhatsApp AI Agent Support Bot
    Abrir plantilla
  • AI Agent Rerank Cohere
    Abrir plantilla
  • How to build a WhatsApp Group Moderator with AI
    Abrir plantilla
  • WhatsApp + Hubspot Automation (CRM)
    Abrir plantilla
  • WhatsApp + Slack Automation
    Abrir plantilla
  • WhatsApp Bot that understands Text, Audio, Images and PDFs
    Abrir plantilla
  • WhatsApp Group AI Moderator
    Abrir plantilla
  • WhatsApp Shopify Integration
    Abrir plantilla
  • Publish latest YouTube videos on a WhatsApp Channel
    Abrir plantilla
logo

The complete WhatsApp communication solution for teams and companies.

Get started in minutes with a free 7-day trial. No credit card.

DemoFree trial
Product
  • Features
  • Use cases
  • FlowsNew
  • Mobile applicationNew
  • BellsNew
  • WhatsApp Business AppNew
  • Multi-user chat
  • Pricing
  • Questions
Resources
  • WABA Pricing CalculatorNew
  • Help Center
  • Tutorials
  • API Documentation
  • WhatsApp link generator
  • Chat button in WhatsApp
Information
  • Affiliate program
  • Reseller Program
  • Service status
  • Blog
  • Contact

© 2026 - PulpoChat

  • Terms of use
  • ·
  • Privacy