CSV Import

Import your existing company or contact lists via CSV and let Linkt's AI agents enrich them with additional data points. This guide walks through the complete workflow from file upload to retrieving enriched entities.

Prerequisites

Before starting, ensure you have:

A Linkt account with API access
Your API key (see Authentication)
A CSV file with company or contact data

Supported File Types

Linkt accepts both CSV (.csv) and Excel (.xlsx, .xls) files. Excel files are automatically converted to CSV format using the first sheet.

CSV Format Requirements

Your CSV file should follow these guidelines:

Required

Header row — First row must contain column names
Primary column — At least one column with entity names (company names or person names)
UTF-8 encoding — Ensure proper character encoding

Recommended Columns

For company data:

Company name (required as primary column)
Website/domain
Industry
Location
Employee count

For person data:

Full name (required as primary column)
Company name
Job title
Email
LinkedIn URL

Example CSV

company_name,website,industry,location
Acme Corporation,acme.com,Software,San Francisco
TechStartup Inc,techstartup.io,SaaS,New York
GlobalCorp,globalcorp.com,Manufacturing,Chicago

Step 1: Upload Your CSV

Upload your CSV file to get a file_id for the ingest task.

from linkt import Linkt
 
client = Linkt()
 
file_data = client.files.upload(file_path="companies.csv")
file_id = file_data.file_id
print(f"Uploaded: {file_data.csv_metadata.row_count} rows")

Response:

{
  "file_id": "507f1f77bcf86cd799439001",
  "name": "companies.csv",
  "content_type": "text/csv",
  "size_bytes": 2048,
  "csv_metadata": {
    "row_count": 150,
    "columns": ["company_name", "website", "industry", "location"],
    "preview_rows": [
      {"company_name": "Acme Corporation", "website": "acme.com", "industry": "Software", "location": "San Francisco"}
    ],
    "encoding": "utf-8"
  },
  "processing_status": "completed"
}

CSV Metadata

The response includes csv_metadata with row count, column names, and preview data. Use this to verify your file was parsed correctly before proceeding. See Files for complete file management API documentation.

Step 2: Create an Enrichment ICP

Create an ICP that defines what data to research and add to your entities. For CSV import, the entity target description should focus on enrichment fields rather than search criteria.

icp = client.icp.create(
    name="Company Enrichment",
    description="Enrich imported company data with additional fields",
    mode="discovery",
    entity_targets=[
        {
            "entity_type": "company",
            "description": """## Enrichment Fields
- primary_product: The main product or service offered
- tech_stack: Key technologies and platforms used
- recent_funding: Latest funding round and amount
- employee_growth: Year-over-year headcount change""",
            "root": True
        }
    ]
)
icp_id = icp.id

Response:

{
  "id": "507f1f77bcf86cd799439002",
  "name": "Company Enrichment",
  "mode": "discovery",
  "entity_targets": [...],
  "created_at": "2025-01-06T10:00:00Z"
}

Save the id — you'll need it when executing the task.

Step 3: Create a Sheet

Create a sheet to store the enriched entities. The entity_type should match the type of data in your CSV.

sheet = client.sheet.create(
    name="Q1 Imported Companies",
    icp_id="507f1f77bcf86cd799439002",
    entity_type="company"
)
sheet_id = sheet.id

Response:

{
  "id": "507f1f77bcf86cd799439003",
  "name": "Q1 Imported Companies",
  "icp_id": "507f1f77bcf86cd799439002",
  "entity_type": "company",
  "created_at": "2025-01-06T10:01:00Z"
}

Step 4: Configure the Ingest Task

Create an ingest task that references your uploaded file. The task configuration requires:

file_id — The ID from step 1
primary_column — Column name containing entity names to match
csv_entity_type — Entity type in the CSV (company or person)

task = client.task.create(
    name="Import Q1 Companies",
    description="Import and enrich 150 companies from Q1 leads list",
    flow_name="ingest",
    deployment_name="ingest/v1",
    sheet_id="507f1f77bcf86cd799439003",
    task_config={
        "type": "ingest",
        "file_id": "507f1f77bcf86cd799439001",
        "primary_column": "company_name",
        "csv_entity_type": "company"
    }
)
task_id = task.id

Response:

{
  "id": "507f1f77bcf86cd799439004",
  "name": "Import Q1 Companies",
  "flow_name": "ingest",
  "task_config": {
    "type": "ingest",
    "file_id": "507f1f77bcf86cd799439001",
    "primary_column": "company_name",
    "csv_entity_type": "company"
  },
  "created_at": "2025-01-06T10:02:00Z"
}

Task Configuration Fields

Field	Required	Description
`type`	Yes	Config discriminator (must be `"ingest"`)
`file_id`	Yes	MongoDB ObjectId of the uploaded file
`primary_column`	Yes	Column name containing entity names
`csv_entity_type`	Yes	Entity type: `company` or `person`
`webhook_url`	No	URL to receive completion notification

Step 5: Execute and Monitor

Execute the task to start the import and enrichment process.

run = client.task.execute(
    task_id="507f1f77bcf86cd799439004",
    icp_id="507f1f77bcf86cd799439002"
)
run_id = run.run_id
print(f"Run started: {run_id} - Status: {run.status}")

Response:

{
  "run_id": "507f1f77bcf86cd799439005",
  "flow_run_id": "550e8400-e29b-41d4-a716-446655440000",
  "status": "SCHEDULED"
}

The run will progress through states: SCHEDULED → PENDING → RUNNING → COMPLETED. Poll the run endpoint until it reaches a terminal state.

Monitoring Execution

See Execution for polling patterns, status handling, and error recovery.

Monitor Processing Queue

For large imports, you can monitor the processing queue:

queue = client.run.get_queue(run_id="507f1f77bcf86cd799439005")
print(f"Completed: {queue.completed}/{queue.total}")

Queue states:

queued — Waiting to be processed
processing — Currently being enriched
completed — Successfully imported
discarded — Skipped (entity not found or not qualified)

Step 6: Review Results

Once the run completes, retrieve the enriched entities from your sheet.

entities = client.entity.list(
    icp_id="507f1f77bcf86cd799439002",
    entity_type="company"
)
print(f"Total enriched: {entities.total}")

Response:

{
  "entities": [
    {
      "id": "507f1f77bcf86cd799439010",
      "sheet_id": "507f1f77bcf86cd799439003",
      "data": {
        "name": {
          "value": "Acme Corporation",
          "references": ["https://acme.com"]
        },
        "website": {
          "value": "https://acme.com",
          "references": []
        },
        "primary_product": {
          "value": "Enterprise CRM software for mid-market companies",
          "references": ["https://acme.com/products"]
        },
        "tech_stack": {
          "value": "React, Node.js, PostgreSQL, AWS",
          "references": ["https://stackshare.io/acme"]
        },
        "recent_funding": {
          "value": "Series B - $45M (January 2025)",
          "references": ["https://techcrunch.com/acme-series-b"]
        }
      },
      "created_at": "2025-01-06T10:15:00Z"
    }
  ],
  "total": 150,
  "page": 1,
  "page_size": 20
}

Complete Example

Here's a complete Python script for CSV import:

from linkt import Linkt
import time
 
client = Linkt()
 
def csv_import(file_path, primary_column, entity_type="company"):
    """Import and enrich a CSV file."""
 
    # Step 1: Upload CSV
    print("Uploading CSV...")
    file_data = client.files.upload(file_path=file_path)
    file_id = file_data.file_id
    print(f"Uploaded: {file_data.csv_metadata.row_count} rows")
 
    # Step 2: Create ICP
    print("Creating ICP...")
    icp = client.icp.create(
        name=f"Enrichment - {file_path}",
        mode="discovery",
        entity_targets=[{
            "entity_type": entity_type,
            "description": "## Enrichment Fields\n- industry: Industry sector\n- description: Company description\n- employee_count: Number of employees",
            "root": True
        }]
    )
    icp_id = icp.id
 
    # Step 3: Create Sheet
    print("Creating sheet...")
    sheet = client.sheet.create(
        name=f"Import - {file_path}",
        icp_id=icp_id,
        entity_type=entity_type
    )
    sheet_id = sheet.id
 
    # Step 4: Create Task
    print("Creating ingest task...")
    task = client.task.create(
        name=f"Import {file_path}",
        description="CSV import and enrichment",
        flow_name="ingest",
        deployment_name="ingest/v1",
        sheet_id=sheet_id,
        task_config={
            "type": "ingest",
            "file_id": file_id,
            "primary_column": primary_column,
            "csv_entity_type": entity_type
        }
    )
    task_id = task.id
 
    # Step 5: Execute
    print("Executing...")
    run = client.task.execute(task_id=task_id, icp_id=icp_id)
    run_id = run.run_id
 
    # Poll for completion
    while True:
        run_status = client.run.retrieve(run_id=run_id)
        print(f"  Status: {run_status.status}")
 
        if run_status.status == "COMPLETED":
            break
        elif run_status.status in ["FAILED", "CANCELED", "CRASHED"]:
            raise Exception(f"Import failed: {run_status.error}")
 
        time.sleep(10)  # Poll every 10 seconds
 
    # Step 6: Get results
    entities = client.entity.list(icp_id=icp_id, entity_type=entity_type)
 
    print(f"Success! Imported {entities.total} entities")
    return entities
 
# Run import
result = csv_import("companies.csv", "company_name", "company")

Best Practices

Data Quality

Clean your data — Remove duplicates and fix obvious errors before upload
Consistent naming — Use consistent company name formats
Include domains — Website/domain helps with entity matching

Performance

Batch size — For large files (1000+ rows), consider splitting into smaller batches
Timeout — Allow sufficient time for enrichment (larger files take longer)
Monitor queue — Use the queue endpoint to track progress

Enrichment ICP

Focus on custom fields — Default fields (name, website, etc.) are populated automatically
Be specific — Clear field descriptions produce better results
Prioritize fields — List most important fields first

Error Handling

Common Issues

Issue	Cause	Solution
"Column not found"	Primary column doesn't exist	Check column names in csv_metadata
"Invalid CSV format"	Malformed CSV	Ensure proper UTF-8 encoding and formatting
"Entity not found"	Company/person couldn't be matched	Check for typos in entity names
Partial completion	Some rows failed enrichment	Review discarded items in queue

Handling Partial Failures

Some entities may be discarded if they can't be matched or enriched. Check the run queue for details:

discarded = client.run.get_queue(run_id=run_id, state="discarded")
print(f"Discarded: {len(discarded.items)} items")

Next Steps

Files Reference — File management API and processing status
Entities Reference — Understanding enriched data structure
Bulk Enrichment — Advanced enrichment at scale
Execution — Understanding runs and monitoring

On this page