Alloovium

Document Hub

Document Hub

Upload, organise, and manage your documents securely in projects. Projects are the foundation of everything in Alloovium.

Overview

The Document Hub is the central repository for all files in a project. Every document you upload is automatically processed — text is extracted via OCR if needed, chunked into semantically meaningful segments, and embedded for fast AI retrieval.

Once a document is processed, it is immediately available for querying in the AI Assistant, template filling, document generation, and use in analysis tools like Change Impact.

Document Hub view showing a list of uploaded documents with processing status

Uploading Documents

You can upload documents to a project in several ways: by dragging files directly onto the upload area, by clicking the "Upload" button and selecting files from your computer, or by syncing from a connected cloud storage provider (SharePoint, Google Drive, Procore).

  1. 1

    Open the project

    Navigate to Projects in the sidebar and open the project you want to upload into.

  2. 2

    Upload your files

    Drag and drop files onto the upload area, or click "Upload" and select files from your computer. You can upload multiple files at once.

  3. 3

    Wait for processing

    Alloovium processes each document automatically. You will see a progress indicator next to each file. Processing typically takes 30–120 seconds depending on file size.

  4. 4

    Start querying

    Once processing is complete, the document status changes to "Ready". You can now query it via the AI Assistant.

File upload area with drag-and-drop zone and a list of files being processed

Batch uploads

You can upload up to 50 files at once. Large batches are queued and processed in parallel — you do not need to wait for one batch to finish before uploading another.

Supported Formats

FormatExtensionNotes
PDF.pdfIncludes scanned PDFs — OCR is applied automatically
Word.docxTracked changes and comments are included
Excel.xlsx, .xlsAll sheets are processed; tables extracted
PowerPoint.pptxSlide text and speaker notes extracted
Images.png, .jpg, .tiffOCR applied; useful for scanned drawings
Plain text.txt, .csvProcessed directly; no OCR required

Managing Projects

Documents are always organised inside a project. A project is a workspace that holds a set of related documents — for example, all documents related to a specific contract or tender. You can create as many projects as you need and invite different team members to each one.

Within a project, you can create folders to further organise your documents. Folders are optional — all documents in a project are available for querying regardless of which folder they are in.

Project view with folder tree on the left and document list on the right

Storage limits

Storage limits depend on your subscription plan. Contact support if you need to increase your storage quota.

AI Organizer & Auto-Sort

If you have a loose batch of mixed files with no folder structure, use the AI Organizer from the main Projects page. It previews a project-and-folder layout before anything is created, so you can sort a messy intake of specifications, drawings, registers, memos, standards, and instructions in one pass.

  1. 1

    Open Projects and start AI Organizer

    From the main Projects page, click "AI Organizer". This flow is designed for unsorted batches that are not yet inside a project.

  2. 2

    Upload the loose files

    Drop in the whole intake at once. The organizer reads filenames, paths, and extracted content to infer likely projects and document types.

  3. 3

    Review the proposed structure

    Alloovium shows the suggested projects and target folders before upload. Low-confidence files are flagged so they can be checked manually.

Loose file typeSuggested folderTypical examples
Specifications and standards03 Specifications & Technical / 03.01 SpecificationsSpecifications, scopes, standards, technical requirements, manuals
Drawings and issued sheets02 Drawings & Models / 02.01 Sheets & Issued DrawingsDrawings, plans, elevations, details, issued sketches
Registers and logs06 Controls & Schedules / 06.02 Registers & LogsRegisters, trackers, logs, schedules, look-aheads
Memos and correspondence07 Meetings & Correspondence / 07.02 Correspondence & EmailsMemos, letters, general correspondence, email exports
Instructions and notices08 Changes & Instructions / 08.02 Instructions & NoticesSite instructions, architect instructions, notices, formal directions
AI organizer preview showing suggested projects and destination folders before upload

Use the whole batch

The organizer works best when you upload the full unsorted intake together. That gives it enough context to separate projects and choose the right folder paths.

Review safeguards

Low-confidence files are marked for review instead of being silently moved into a folder you did not expect.

Document Types

Alloovium automatically classifies uploaded documents into categories to help with organisation and to optimise analysis. Common types include contracts, specifications, drawings, reports, and correspondence. You can also assign a custom type to any document.

TypeDescriptionBest used for
ContractLegal agreements and subcontractsPayment terms, obligations, liquidated damages
SpecificationTechnical and performance specsScope verification, compliance checking
DrawingEngineering and architectural drawingsClash detection, revision tracking
ReportSite reports, inspection reportsIssue tracking, defect management
CorrespondenceEmails, RFIs, instructionsNotice tracking, variation history

Processing Pipeline

Understanding how Alloovium processes your documents helps you get the most out of the platform. Each document goes through the following stages:

  1. Text extractionAlloovium reads the raw text from each page. For scanned PDFs and images, GPU-accelerated OCR is applied automatically.
  2. Layout analysisThe document structure is analysed: headings, tables, figures, and paragraph blocks are identified and tagged.
  3. ChunkingThe document is split into semantically meaningful segments (typically 200–500 tokens each) that are small enough for precise retrieval.
  4. EmbeddingEach chunk is converted into a vector embedding using a high-dimensional model trained on technical and legal language.
  5. IndexingEmbeddings are stored in a vector index for fast similarity search, enabling sub-second retrieval during AI queries.

GPU extraction

Alloovium uses GPU-accelerated document extraction for complex PDFs and engineering drawings. This means higher accuracy on scanned documents, tables with merged cells, and multi-column layouts.