Sources
Your original data—uploaded files—that Shadowfax never modifies.
What are Sources?
Sources are your raw data imports. Currently, you can upload CSV, Excel, and Parquet files. Sources represent the starting point for all analysis. Shadowfax treats Sources as immutable—they're read-only and never changed by transformations.
Drag and drop files to import data
Why Sources Matter
Immutability: Your original data stays pristine. All transformations create new Views—never modify Sources.
Flexibility: Combine multiple file uploads in the same Workbook. Database connections coming soon.
Context: Add descriptions and annotations so the AI understands your data better.
Traceability: Always see where your analysis started. Every View traces back to Sources.
How to Import Data
Uploading Files
- Click "Add Source" or the + icon in your Workbook
- Drag and drop files or click to browse
- Supported formats: CSV, TSV, Excel (.xlsx), Parquet
- Preview your data before confirming
- Watch the upload progress bar
Drag and drop files for instant upload
Connecting to Databases (Coming Soon)
We're building support for live database connections. You'll soon be able to:
- Connect to databases like Snowflake, BigQuery, and others
- Import tables or views directly
- Set automatic refresh schedules
- Keep your data current without manual re-uploads
More details coming as this feature launches
Adding Context and Metadata
After importing, help the AI understand your data:
Dataset Description: Let the AI generate an automatic description, or write your own. This helps the AI know what questions to answer.
Column Context: Add notes explaining what columns mean, especially for:
- Abbreviations or codes (e.g., "Status codes: 1=Active, 2=Suspended")
- Similar column names (e.g., "ship_date is when we sent it, delivery_date is when customer received it")
- Business rules (e.g., "discount_pct should always be between 0 and 100")
- Data quality notes (e.g., "created_date is sometimes NULL for legacy records")
Data Patterns: Specify formats the AI might not recognize automatically (e.g., "dates are in DD/MM/YYYY format")
Add column-level annotations to guide the AI
Common Use Cases
- Upload spreadsheets: Drop your Excel reports directly into Shadowfax
- Combine datasets: Mix multiple uploaded files in one analysis
- One-time exploration: Upload CSV exports for quick ad-hoc analysis
Tips & Best Practices
Name your Sources clearly: "orders_2024.csv" is better than "export_v3_final.csv"
Add context immediately: Spend 30 seconds describing your data. It saves hours of back-and-forth with the AI.
Preview before importing: Check that column names and data types look correct.
Use column annotations: If two columns have similar names, explain the difference. The AI will use this context in all future queries.
Start with clean data when possible: While Shadowfax can handle messy data, cleaner Sources lead to faster results.
Understanding Your Data
After import, use the Profile view to see:
- Row and column counts
- Data types per column
- Summary statistics
- Null value percentages
- Sample values
Click the "Profile" toggle at the bottom panel to inspect metadata.
Profile view shows column statistics and data quality info
Related Features
- Workbooks - Where your Sources live
- Views - Transform Sources without changing them
- Data Refresh - Keep live data current
- Schema Discovery - How Shadowfax understands relationships