Skip to main content

๐Ÿ–ฅ๏ธ Sources & Formats in Outmind

Nicolas Movio avatar
Written by Nicolas Movio
Updated over 3 weeks ago

๐Ÿ–ฅ๏ธ Indexable Sources

Outmind can automatically index your companyโ€™s data from a wide range of platforms:

Source

Items Indexed

Teams

Messages, Files (hosted on OneDrive or SharePoint)

OneDrive

Files

Outlook

Emails, Attachments

SharePoint

Files, Emails, Pages

Box

Files

Dropbox

Files

Google Drive

Files

Gmail

Emails, Attachments

iManage

Files, Emails, Attachments

Local Drive

Files

Network Drive

Files

Notion

Pages

Slack

Messages, Files


๐Ÿ“„ Types of Documents Indexable in Outmind

  • Files

  • Emails

  • Messages


๐Ÿ” Available Format Filters

Text Files

pdf, aspx, doc, dot, rtf, gdoc, docm, dotm, odt, docx, dotx, wri, markdn, markdown, md, mdown, txt, text, conf, def, list, log, in, ini, mkd

Presentations

gslides, ppt, pps, pot, pptm, odp, pptx, ppsx

Spreadsheets

gsheet, dis, xls, xlm, xla, xlc, xlt, xlw, xlsb, xlsm, xltm, ods, xlsx, xltx, xml, csv, tsv

Images

gif, heic, jpeg, jpg, jpe, png, svg, svgz, tif, tiff, webp, ppm, tga, xpm

Videos

mts, mp4, mp4v, mpg4, mpeg, mpg, mpe, m1v, m2v, wm, avi

Audio

au, snd, mp4a, oga, ogg, spx, opus, wma

Email Archives

msg, pst, eml, mime

Compressed Files

gz, rar, 7z, bz2, boz, zip

3D / CAD Plans and Models

skp, obj, dwg, dxf, 3ds, igs, iges, dwf, dgn, ifc, rvt, rfa, rte, rft, shp, prjv, dxb, nwd, nwf, nwc, fbx, ige, 3dm, sat


๐Ÿงฉ Formats Indexed for Metadata Only

All files are indexed for their metadata, except excluded files:

System Files (Disk)

  • Hidden files (. prefix on Unix/Mac)

  • Temporary Office files (~$, ~*.tmp)

  • Cloud placeholder files (OneDrive files not locally downloaded)

Media Files in Email Attachments

  • Images, Audio, Videos (e.g., email signature logos)

Excluded Outlook Folders

  • IPF, IPF.Appointment, IPF.Configuration, IPF.Contact, IPF.StoreItem.EventCheckPoints, IPF.Task


๐Ÿ“š Formats Indexed for Content

Text Documents

doc, dot, docx, dotx, docm, odt, txt, text, md, markdown, markdn, mdown, mkd, html, htm, shtml, rtf, conf, def, list, log, in, ini, aspx

Spreadsheets

xls, xlm, xla, xlc, xlt, xlw, xlsx, xlsb, xlsm, xltx, csv, ods

Presentations

ppt, pps, pot, pptx, ppsx, pptm, odp

PDFs

pdf

Google Documents

gdoc, gslides, gsheet

Email Archives

msg, pst, eml, mime

Other

rar, zip, vsd, vst, vss, vsw (Visio)


๐Ÿ‘๏ธโ€๐Ÿ—จ๏ธ Formats Processed by OCR

OCR (Optical Character Recognition) is applied using AWS Textract on PDFs meeting the following criteria:

  • The file is a PDF

  • The page does not already contain text


๐Ÿ’ฌ Formats Accessible by the AI Assistant

Once indexed, files become accessible to the AI Assistant for reading and reasoning:

doc, dot, docx, dotx, docm, csv, xlsx, xml, pdf, ppt, pps, pot, pptx, ppsx, pptm, odp, gdoc, gslides, vsd, vst, vss, vsw, txt, md, html, aspx, wri, conf, def, list, log, in, ini, gif, jpeg, jpg, png, tar, zip, js, json, md, txt, html, css, c, cpp, java, php, ts, tex

(Note: Newly connected sources may require a short delay before the assistant can access their content.)


๐Ÿ‘€ Formats Viewable Directly in Outmind

Files

html, htm, shtml, xhtml, xml, xsl, xsd, xslt, svg, svgz, pdf, doc, dot, docx, dotx, docm, ppt, pps, pot, pptx, ppsx, pptm, odp, gdoc, gslides, vsd, vst, vss, vsw, xls, xlm, xla, xlc, xlt, xlw, xlsb, xlsm, xlsx, xltx, csv, gsheet, md, markdown, markdn, mdown, mkd, txt, text, conf, def, list, log, in, ini

Images

gif, jpeg, jpg, jpe, png

Email Archives

msg, eml


๐Ÿ“‘ Formats Generating Dedicated Pages (Pages Tab)

doc, dot, docx, dotx, docm, ppt, pps, pot, pptx, ppsx, pptm, odp, pdf, gdoc, gslides, vsd, vst, vss, vsw


๐Ÿ“ฆ Archive Formats Whose Content Is Extracted and Indexed

zip, rar, pst, eml, msg

Did this answer your question?