# LMNT

LMNT focuses on voice and audio manipulation, possibly leveraging AI to generate or transform sound for various creative and technical use cases

- **Category:** artificial intelligence
- **Auth:** API_KEY
- **Composio Managed App Available?** N/A
- **Tools:** 8
- **Triggers:** 0
- **Slug:** `LMNT`
- **Version:** 20260316_00

## Tools

### Create Voice

**Slug:** `LMNT_CREATE_VOICE`

Creates a custom voice in LMNT by training on uploaded audio samples. The voice can then be used for text-to-speech synthesis. Returns the voice ID and metadata upon successful creation. The voice may be in 'training' state initially before becoming 'ready'.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `name` | string | Yes | The display name for this voice. |
| `type` | string ("instant" | "professional") | No | The type of voice to create. Use 'instant' for quick voice cloning or 'professional' for higher quality. Defaults to 'instant'. |
| `files` | object | Yes | Audio file to train the voice. Supported formats: WAV, MP3, MP4, M4A, WebM. Maximum file size: 250 MB. |
| `gender` | string | No | Optional gender tag for this voice (e.g., 'male', 'female', 'nonbinary'). This is metadata only and does not affect voice generation. |
| `enhance` | boolean | No | Whether to apply audio processing to reduce background noise. Set to true for unclean audio, but note this may degrade quality in some cases. Defaults to false. |
| `description` | string | No | Optional text description of this voice for organizational purposes. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Delete Voice Info

**Slug:** `LMNT_DELETE_VOICE_INFO`

Deletes a voice from your LMNT account. This operation permanently removes the voice and cancels any pending operations on it. This action cannot be undone. Only voices owned by you (owner='me') can be deleted; system voices cannot be deleted. Use case: Remove custom voices that are no longer needed to manage your voice library.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | The unique identifier of the voice to delete. This must be a voice owned by you (voices with owner='me'). System voices cannot be deleted. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Generate Speech With Metadata

**Slug:** `LMNT_GENERATE_SPEECH_WITH_METADATA`

Generates speech from text and returns JSON with base64-encoded audio and optional word-level timing metadata. Use when you need the synthesis seed or word timestamps for subtitle synchronization. For lower latency without metadata, use the Synthesize Speech action instead.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `seed` | integer | No | Integer seed for reproducible speech variations. Use the same seed to replicate a specific output. |
| `text` | string | Yes | The text to synthesize into speech (max 5000 characters including spaces). |
| `debug` | boolean | No | When true, saves the synthesis clip to your clip library for debugging purposes. |
| `model` | string | No | The synthesis model to use (default: 'blizzard'). |
| `top_p` | number | No | Controls speech stability (0-1 range, default: 0.8). Lower values produce more consistent speech. |
| `voice` | string | Yes | The voice ID to use for speech synthesis (e.g., 'lily', 'leah', 'daniel'). Use the List Voices action to get available voice IDs. |
| `format` | string | No | Output audio format. Streamable formats (generate faster): mp3 (default), ulaw, webm, pcm_s16le, pcm_f32le. Non-streamable: aac, wav. |
| `language` | string | No | ISO 639-1 two-letter language code (e.g., 'en', 'es', 'fr'). Auto-detected by default, but specifying language improves generation speed. |
| `sample_rate` | integer | No | Audio sample rate in Hz. Options: 8000, 16000, or 24000. |
| `temperature` | number | No | Controls speech expressiveness (numeric value, default: 1.0). Higher values increase variation. |
| `return_durations` | boolean | No | When true, returns word-level duration timestamps in the response. Useful for synchronizing subtitles or animations with speech. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get Account

**Slug:** `LMNT_GET_ACCOUNT`

Retrieves account information including subscription plan details and current usage statistics.

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get Voice Info

**Slug:** `LMNT_GET_VOICE_INFO`

Gets metadata for a specific LMNT voice, including active status, supported languages, and plan availability. Useful for validating a voice ID before using it in synthesis requests.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | The ID of the voice to retrieve. Can be obtained from the list voices endpoint. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Get Voices List

**Slug:** `LMNT_GET_VOICES_LIST`

Retrieves a list of available voices from LMNT. Returns both system-provided preset voices and any custom voices you have created. Use filters to narrow results by ownership (system vs custom) or starred status. Each voice includes details like ID, name, description, gender, state, and preview URL.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `owner` | string ("all" | "system" | "me") | No | Filter voices by owner. Options: 'all' (default, shows all voices), 'system' (only LMNT preset voices), 'me' (only your custom voices). |
| `starred` | boolean | No | Filter to show only starred voices. Set to true to return only voices you have starred, false (default) to show all voices. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Synthesize Speech

**Slug:** `LMNT_SYNTHESIZE_SPEECH`

Synthesizes speech from text using LMNT's AI voices. Converts text (up to 5000 characters) into natural-sounding speech audio using a specified voice. Returns base64-encoded audio at `data.response_data.audio` — decode before saving or passing to other tools. Supports multiple audio formats and quality settings for different use cases.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `seed` | integer | No | Integer seed for reproducible speech variations. Use the same seed to replicate a specific output. |
| `text` | string | Yes | The text to synthesize into speech (max 5000 characters including spaces). For texts exceeding 5000 characters, split into chunks and call separately, keeping `voice`, `model`, `format`, and `sample_rate` identical across all chunks to avoid audible seams. |
| `debug` | boolean | No | When true, saves the synthesis clip to your clip library for debugging purposes. |
| `model` | string | No | The synthesis model to use (default: 'blizzard'). |
| `top_p` | number | No | Controls speech stability (0-1 range, default: 0.8). Lower values produce more consistent speech. |
| `voice` | string | Yes | The voice ID to use for speech synthesis (e.g., 'lily', 'leah', 'daniel'). Use the List Voices action to get available voice IDs. |
| `format` | string | No | Output audio format. Streamable formats (generate faster): mp3 (default), ulaw, webm, pcm_s16le, pcm_f32le. Non-streamable: aac, wav. |
| `language` | string | No | ISO 639-1 two-letter language code (e.g., 'en', 'es', 'fr'). Auto-detected by default, but specifying language improves generation speed. |
| `sample_rate` | integer | No | Audio sample rate in Hz. Options: 8000, 16000, or 24000 (default). |
| `temperature` | number | No | Controls speech expressiveness (numeric value, default: 1.0). Higher values increase variation. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |

### Update Voice Info

**Slug:** `LMNT_UPDATE_VOICE`

Updates information about a specific voice in LMNT. You can update the name, description, gender, starred status, and unfreeze state of a voice. Note: Only user-owned voices (owner='me') can have their name, description, and gender updated. System voices can only be starred/unstarred.

#### Input Parameters

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `id` | string | Yes | The unique identifier of the voice to update |
| `name` | string | No | The display name for this voice. |
| `gender` | string | No | A tag describing the gender of this voice, e.g. male, female, nonbinary. |
| `starred` | boolean | No | If true, adds this voice to your starred list. If false, removes it from your starred list. |
| `unfreeze` | boolean | No | If true, unfreezes this voice and upgrades it to the latest model. |
| `description` | string | No | A description of this voice. |

#### Output

| Parameter | Type | Required | Description |
|-----------|------|----------|-------------|
| `data` | string | Yes | Data from the action execution |
| `error` | string | No | Error if any occurred during the execution of the action |
| `successful` | boolean | Yes | Whether or not the action execution was successful or not |
