Using Inwire

This guide covers day-to-day workflows in Inwire — from registering models to running training jobs and setting up RAG pipelines. By the end, you'll understand how the major modules work together in typical ML workflows.


Overview of Common Workflows

Inwire supports several interconnected workflows:

┌─────────────────────────────────────────────────────────────────────────────┐
│                         Typical ML Workflow in Inwire                        │
└─────────────────────────────────────────────────────────────────────────────┘

   ┌─────────┐      ┌─────────┐      ┌─────────┐      ┌─────────┐
   │  Data   │  →   │ Model   │  →   │ Deploy  │  →   │ Monitor │
   │  Prep   │      │Training │      │         │      │         │
   └─────────┘      └─────────┘      └─────────┘      └─────────┘
        │                │                │                │
        ▼                ▼                ▼                ▼
   ┌─────────┐      ┌─────────┐      ┌─────────┐      ┌─────────┐
   │ Synthex │      │ Model   │      │ModelOps │      │ Observe │
   │         │      │Training │      │         │      │         │
   └─────────┘      └─────────┘      └─────────┘      └─────────┘

Working with Data in Synthex

Synthex is the "data twin" of Model Training — it manages all data-related operations including:

Importing a Dataset

  1. Navigate to SynthexDatasets
  2. Click Import Dataset
  3. Choose your source:

- Upload File — CSV, Parquet, JSONL

- Cloud Storage — S3, GCS, Azure Blob

- Database — Direct query (if configured)

  1. Configure import options:

| Option | Description |

|--------|-------------|

| Name | Descriptive dataset name |

| Description | What this data represents |

| Source Type | Tabular, Text, Time Series, etc. |

| Tags | For organization and filtering |

  1. Click Import

The system will automatically profile your data and detect the schema.

Creating a Data Profile

Data profiles define the schema and characteristics of your dataset:

  1. Go to SynthexData Profiles
  2. Click Create Profile
  3. Define columns:

```

Column: customer_id

Type: String

Constraints: Unique, Not Null

Column: transaction_amount

Type: Float

Constraints: Min: 0, Max: 100000

Column: is_fraud

Type: Boolean

Distribution: 1% True, 99% False

```

  1. Save the profile

Generating Synthetic Data

For detailed synthetic data generation, see the Synthex User Guide.

Quick generation workflow:

  1. Go to SynthexGenerate Data
  2. Select a data profile or source dataset
  3. Choose generation method:

- Statistical — Fast, preserves distributions

- GAN-based — Higher quality, slower

- LLM-based — For text and complex patterns

  1. Configure options (record count, privacy level)
  2. Click Generate

Running Training Jobs

Creating an Experiment

Experiments organize related training runs:

  1. Go to Model TrainingExperiments
  2. Click New Experiment
  3. Fill in details:

| Field | Example |

|-------|---------|

| Name | fraud-detector-2024 |

| Description | Binary classifier for transaction fraud |

| Tags | fraud, classification, production |

  1. Click Create

Starting a Training Job

Use the Training Wizard for guided setup:

Step 1: Select Model

Choose your model type and framework:

Step 2: Select Dataset

This is where Synthex integrates with Model Training:

  1. Click Select Dataset
  2. Browse available datasets:

- Raw Datasets — Original uploaded data

- Cleaned Datasets — Processed versions

- Synthetic/Augmented — Generated data

  1. Select dataset version
  2. Optionally select a Data Recipe for transformations

> Note: The UI queries Synthex to show available datasets. Your selection is recorded for reproducibility.

Step 3: Configure Training

Set hyperparameters and training options:

# Example configuration
learning_rate: 0.001
batch_size: 32
epochs: 100
optimizer: adam
early_stopping:
  patience: 10
  metric: val_loss

Step 4: Select Infrastructure

Choose compute resources:

Step 5: Review and Launch

  1. Review all settings
  2. Click Start Training
  3. Monitor progress in real-time

Monitoring Training Progress

The training dashboard shows:

┌────────────────────────────────────────────────────────────┐
│  Training: fraud-detector-run-42                           │
├────────────────────────────────────────────────────────────┤
│  Status: Running (Epoch 45/100)                            │
│                                                            │
│  ┌──────────────────────────────┐  Metrics                 │
│  │ Loss                          │  ────────               │
│  │  ▁▂▃▄▅▄▃▂▁▁▁                 │  Train Loss: 0.023      │
│  │                              │  Val Loss: 0.031        │
│  └──────────────────────────────┘  Accuracy: 98.2%        │
│                                                            │
│  GPU: 78%   Memory: 12.4/16 GB   ETA: 23 min              │
└────────────────────────────────────────────────────────────┘

Comparing Experiments

To compare multiple runs:

  1. Go to Model TrainingExperiments
  2. Select experiments to compare (checkbox)
  3. Click Compare
  4. View side-by-side:

- Configuration differences

- Metric comparisons

- Training curves


Deploying Models with ModelOps

Registering a Model

After training completes:

  1. Go to Model TrainingExperiments[Your Experiment]
  2. Select the best run
  3. Click Register Model
  4. Fill in model details:

| Field | Example |

|-------|---------|

| Name | fraud-detector |

| Version | 1.0.0 |

| Description | Production fraud detection model |

| Tags | production, fraud, v1 |

  1. Click Register

The model is now in the Model Registry.

Creating a Deployment

  1. Go to ModelOpsDeployments
  2. Click New Deployment
  3. Configure:

| Setting | Description |

|---------|-------------|

| Model | Select from registry |

| Version | Model version to deploy |

| Environment | dev, staging, or prod |

| Replicas | Number of instances |

| Resources | CPU/Memory/GPU allocation |

  1. Click Deploy

Deployment Strategies

Inwire supports several deployment strategies:

Strategy Description Use Case
Rolling Gradual replacement Low-risk updates
Blue/Green Instant switch Zero-downtime releases
Canary Partial traffic shift Testing in production
Shadow Mirror traffic Validation without impact

Monitoring Deployments

The deployment dashboard shows:


Building RAG Pipelines

Understanding RAG in Inwire

RAG (Retrieval-Augmented Generation) pipelines combine:

  1. Document retrieval from a knowledge base
  2. LLM generation using retrieved context

Creating a RAG Pipeline

  1. Go to RAGPipelines
  2. Click Create Pipeline
  3. Configure stages:

Stage 1: Data Sources

Add documents to index:

Stage 2: Processing

Configure document processing:

Stage 3: Retrieval

Set up search configuration:

Stage 4: Generation

Configure LLM generation:

Testing RAG Pipelines

  1. Go to your pipeline
  2. Click Test
  3. Enter a query
  4. Review:

- Retrieved documents

- Generated response

- Confidence scores

RAG and Synthex Integration

Synthex can generate test data for RAG pipelines:

See the Synthex User Guide for details on generating RAG evaluation data.


Working with Real-time Data (Stream)

Understanding Stream

The Stream service handles real-time data:

Creating a Stream Pipeline

  1. Go to StreamPipelines
  2. Click Create Pipeline
  3. Configure:

- Source — Kafka, webhook, database CDC

- Transformations — Filter, map, aggregate

- Sink — Storage, model inference, alerts

Monitoring Streams

View real-time metrics:


Prompt Engineering with PromptScope

Testing Prompts

  1. Go to PromptScopePlayground
  2. Enter your prompt
  3. Select model and parameters
  4. Run and iterate

Managing Prompt Templates

  1. Go to PromptScopeTemplates
  2. Create reusable prompt templates
  3. Version and compare templates
  4. Deploy templates to production

A/B Testing Prompts

  1. Create variant prompts
  2. Set up an A/B test
  3. Route traffic to variants
  4. Analyze results

Reproducibility and Lineage

Understanding Data Lineage

Inwire tracks the complete lineage of your ML artifacts:

Raw Dataset → Cleaned Dataset → Synthetic Dataset → Training Run → Model → Deployment
     ↓              ↓                  ↓                 ↓          ↓          ↓
[Synthex]      [Recipe]         [Synthex Config]    [Experiment]  [Registry] [ModelOps]

Viewing Lineage

  1. Go to any artifact (dataset, model, deployment)
  2. Click Lineage or History
  3. View the complete chain of operations

Reproducing Results

To reproduce a training run:

  1. Go to the experiment run
  2. View Configuration — All parameters used
  3. View Data — Exact dataset version and recipe
  4. Click Reproduce to create a new run with identical settings

Best Practices

Data Management

Training Workflows

Deployment Safety


Common Workflows Summary

Workflow Services Used Key Steps
Train a classifier Synthex → Model Training Import data → Create profile → Train → Evaluate
Deploy a model Model Training → ModelOps Register model → Create deployment → Monitor
Build RAG pipeline RAG + Synthex Add documents → Configure retrieval → Test with synthetic queries
Generate test data Synthex Create profile → Generate synthetic → Export

Next Steps

For deeper dives into specific services:

For help with specific tasks, use the in-app help or return to the User Guide.