Skip to main content

Uploading and Managing Files

Upload PDF files to Trustana and let AI automatically process them for data extraction. Learn file requirements and best practices.

Upload PDF files to Trustana and let our AI automatically extract product data. This guide covers how to upload files and what happens behind the scenes.


What Are Files in Trustana?

Files (PDFs) are product documents like spec sheets, catalogs, or data sheets that contain valuable product information. When you upload a file, Trustana:

  1. Reads the content using advanced OCR technology

  2. Finds product identifiers (SKU, Product Model) on each page

  3. Matches pages to your products automatically

Once processed, you can extract product data from these files using AI.


Step 1: Choose Where to Upload

You can upload files from three different places in Trustana:

Option A: From a Product Detail Page

  1. Go to any product

  2. Click the Files tab

  3. Upload files directly linked to that product

Option B: From Digital Assets > All Files

  1. Click Digital Assets in the menu

  2. Select All Files

  3. Upload files that may contain multiple products

Option C: From Bulk Media Upload

  1. Select products in the All Products table

  2. Click Bulk Media Upload

  3. Select a product in the left panel

  4. Click the Files tab on the right panel


Step 2: Upload Your Files

  1. Click Upload Files

  2. A modal window appears

  3. Drag and drop your PDF files, or click to browse

  4. Click Upload file to start the upload

Tip: You can upload multiple files at once.

Trustana de-duplicates uploaded files by content signature, not filename. Re-uploading an identical PDF will not create a second copy.


Step 3: Wait for Processing

After upload, Trustana automatically processes your files:

What Happens

Description

Content Extraction

AI reads all text and tables from each page

Identifier Detection

System finds SKUs and Product Models

Product Matching

Pages are linked to matching products in your account

Note: Processing typically takes 1–3 minutes for small files, and 10–20 minutes for files with 5 or more pages.

You'll receive an email notification when processing is complete. Wait for this notification before starting an extraction task β€” products in a file that has not finished processing will be excluded from the task.

Once processed, you can download any file individually from Digital Assets > All Files or a product's Files tab.


File Requirements

Requirement

Limit

File format

PDF only (.pdf)

Maximum pages

50 pages per file

Maximum file size

20 MB per file

File must be

Unlocked (not password-protected)

PDFs in multiple languages are accepted.


Tips for Preparing Files

To get the most accurate data extraction:

  1. Use clean, well-structured files

    • Tables with clear headers work best

    • Avoid files with heavily merged or nested cells

  2. Make the identifier visible on every data page

    • The SKU or Product Model must appear on every page that contains data you want extracted

    • Pages without an identifier match are skipped during extraction

  3. Check your identifier format

    • The SKU or Product Model in Trustana must match exactly how it appears in the PDF

    • Copying a product name into the Product Model field does not work

  4. One-page product spec sheets are ideal

    • Single product per page = highest accuracy

    • Multi-product tables also work well

  5. Avoid merging pages into one long page

    • Do not merge many PDF pages into a single tall page β€” this impacts processing time, cost, and extraction quality


What's next

Did this answer your question?