InsiteChat document upload lets you train your AI chatbot on files from your existing content library — PDFs (with OCR for scanned documents), Word documents, PowerPoint decks, Excel sheets, CSVs, Markdown, and plain text. InsiteChat extracts the text and indexes it alongside any other sources you’ve added — website crawls, Google Drive, Notion, and Dropbox.Documentation Index
Fetch the complete documentation index at: https://docs.insitechat.ai/llms.txt
Use this file to discover all available pages before exploring further.
Supported file formats
| Format | Extensions |
|---|---|
.pdf | |
| Word document | .doc, .docx |
| PowerPoint | .ppt, .pptx |
| Plain text | .txt |
| Markdown | .md |
| CSV | .csv |
| YouTube transcript | Paste a YouTube URL |
| Zendesk articles | Connect via integration |
Upload a document
Best practices for document quality
The quality of your chatbot’s answers depends heavily on the quality of the documents you upload. Follow these guidelines to get the best results:- Use clear headings — Structure documents with descriptive H1, H2, and H3 headings to help InsiteChat organize content effectively.
- Avoid scanned images without OCR — InsiteChat extracts text from documents. Scanned PDFs containing image-based text cannot be indexed; use selectable text or run OCR first.
- Keep formatting clean — Complex tables and multi-column layouts reduce extraction accuracy. Simplify or convert to plain text/Markdown before uploading.
- One topic per document — Focused documents are easier for AI analysis. Split large multi-topic documents into separate files.
- Check for accuracy — Remove outdated information, correct errors, and delete obsolete sections before uploading.
