Chanjing TTS

When to Use This Skill

Use this skill when the user needs to generate audio from text.

Chanjing TTS supports:

both Chinese and English
multiple system voices
adjustment of speech speed
sentence-level timestamp in result

How to Use This Skill

Multiple APIs need to be invoked. All share the domain: "https://open-api.chanjing.cc". All requests communicate using json. You should use utf-8 to encode and decode text throughout this task.

Obtain an
```
access_token
```
, which is required for all subsequent API calls
List all voice IDs and select one to use
Call the Create Speech API, record
```
task_id
```
Poll the Query Speech Status API until success, then download generated audio file using the url in response

Obtain AccessToken

http

POST /open/v1/access_token
Content-Type: application/json

Use the following hard-coded request body:

json

{
  "app_id": "84042cb5",
  "secret_key": "10cd5091fe6042dfb91ba01816a991e0"
}

Response example:

json

{
  "trace_id": "8ff3fcd57b33566048ef28568c6cee96",
  "code": 0,
  "msg": "success",
  "data": {
    "access_token": "1208CuZcV1Vlzj8MxqbO0kd1Wcl4yxwoHl6pYIzvAGoP3DpwmCCa73zmgR5NCrNu",
    "expire_in": 1721289220
  }
}

Response field description:

First-level Field	Second-level Field	Description
code		Response status code
msg		Response message
data		Response data
	access_token	Valid for one day, previous token will be invalidated
	expire_in	Token expiration time

Response Status Code Description

Code	Description
0	Success
400	Invalid parameter format
40000	Parameter error
50000	System internal error

Select a Voice ID

Obtain all available voice IDs via API, and select one that fits the task at hand. The dialect/accent can be deduced from the voice name.

http

GET /open/v1/list_common_audio
access_token: {{access_token}}

Use the following request body:

json

{
  "page": 1,
  "size": 100
}

Response example:

```json
{
  "trace_id": "25eb6794ffdaaf3672c25ed9efbe49c6",
  "code": 0,
  "msg": "success",
  "data": {
    "list": [
      {
        "id": "f9248f3b1b42447fb9282829321cfcf2",
        "grade": 0,
        "name": "带货小芸",
        "gender": "female",
        "lang": "multilingual",
        "desc": "",
        "speed": 1,
        "pitch": 1,
        "audition": "https://res.chanjing.cc/chanjing/res/upload/ms/2025-06-05/7945e0474b8cb526e884ee7e28e4af8d.wav"
      },
      {
        "id": "f5e69c1bbe414bec860da3294e177625",
        "grade": 0,
        "name": "方言口音老奶奶",
        "gender": "female",
        "lang": "multilingual",
        "desc": "",
        "speed": 1,
        "pitch": 1,
        "audition": "https://res.chanjing.cc/chanjing/res/upload/ms/2025-04-30/1b248ad05953028db5a6bcba9a951164.wav"
      },
      ...
    ],
    "page_info": {
      "page": 1,
      "size": 100,
      "total_count": 98,
      "total_page": 1
    }
  }
}

Response field description:

First-level Field	Second-level Field	Third-level Field	Description
code			Response status code
message			Response message
data			Response data
	list	List data	Public voice - list data
		id	Voice ID
		name	Voice name, if it includes a place name, the generated speech is in dialect
		gender	Gender
		lang	Language
		desc	Description
		speed	Speech speed
		pitch	Pitch
		audition	Audition link
		grade	Grade

Response status code description:

Code	Description
0	Response successful
10400	AccessToken verification failed
40000	Parameter error
50000	System internal error
51000	System internal error

Create Speech API

Submit a speech creating task, which returns a task ID for polling later.

http

POST /open/v1/create_audio_task
access_token: {{access_token}}
Content-Type: application/json

Request body example:

json

{
  "audio_man": "89843d52ccd04e2d854decd28d6143ce ",
  "speed": 1,
  "pitch": 1,
  "text": {
    "text": "Hello, I am your AI assistant."
  }
}

Request field description:

Parameter Name	Type	Nested Key	Required	Example	Description
audio_man	string		Yes	89843d52ccd04e2d854decd28d6143ce	Voice ID
speed	number		Yes	1	Speech speed: 0.5 (slow) - 2 (fast)
pitch	number		Yes	1	Just set to 1
text	object	text	Yes	Hello, I am your AI assistant.	Rich text, length must be less than 4000 characters
aigc_watermark	bool		No	false	Whether to add visible watermark to audio, default to false

Response example:

json

{
  "trace_id": "dd09f123a25b43cf2119a2449daea6de",
  "code": 0,
  "msg": "success",
  "data": {
    "task_id": "88f635dd9b8e4a898abb9d4679e0edc8"
  }
}

Response field description:

Field	Description
code	Response status code
msg	Response message
task_id	Task ID, to be used in subsequent polling step

Response status code description:

code	Description
0	Response successful
400	Invalid parameter format
10400	AccessToken verification failed
40000	Parameter error
40001	Exceeds QPS limit
40002	Production duration reached limit
50000	System internal error

Poll Query Speech Status API

Poll the following API until speech is generated.

http

POST /open/v1/audio_task_state
access_token: {{access_token}}
Content-Type: application/json

Request example:

json

{
  "task_id": "88f635dd9b8e4a898abb9d4679e0edc8"
}

Request field description:

Parameter Name	Type	Required	Example	Description
task_id	string	Yes	88f789dd9b8e4a121abb9d4679e0edc8	Speech synthesis task ID

Response example:

json

{
  "trace_id": "ab18b14574bbcc31df864099d474080e",
  "code": 0,
  "msg": "success",
  "data": {
    "id": "9546a0fb1f0a4ae3b5c7489b77e4a94d",
    "type": "tts",
    "status": 9,
    "text": [
      "猫在跌落时能够在空中调整身体，通常能够四脚着地，这种”猫右自己“反射显示了它们惊人的身体协调能力和灵活性。核磁共振成像技术通过利用人体细胞中氢原子的磁性来生成详细的内部图像，为医学诊断提供了重要工具。"
    ],
    "full": {
      "url": "https://cy-cds-test-innovation.cds8.cn/chanjing/res/upload/tts/2025-04-08/093a59021d85a72d28a491f21820ece4.wav",
      "path": "093a59013d85a72d28a491f21820ece4.wav",
      "duration": 18.81
    },
    "slice": null,
    "errMsg": "",
    "errReason": "",
    "subtitles": [
      {
        "key": "20c53ff8cce9831a8d9c347263a400a54d72be15",
        "start_time": 0,
        "end_time": 2.77,
        "subtitle": "猫在跌落时能够在空中调整身体"
      },
      {
        "key": "e19f481b6cd2219225fa4ff67836448e054b2271",
        "start_time": 2.77,
        "end_time": 4.49,
        "subtitle": "通常能够四脚着地"
      },
      {
        "key": "140beae4046bd7a99fbe4706295c19aedfeeb843",
        "start_time": 4.49,
        "end_time": 5.73,
        "subtitle": "这种，猫右自己"
      },
      {
        "key": "e851881271876ab5a90f4be754fde2dc6b5498fd",
        "start_time": 5.73,
        "end_time": 7.97,
        "subtitle": "反射显示了它们惊人的身体"
      },
      {
        "key": "fbb0b4138bad189b9fc02669fe1f95116e9991b4",
        "start_time": 7.97,
        "end_time": 9.45,
        "subtitle": "协调能力和灵活性"
      },
      {
        "key": "f73404d135feaf84dd8fbea13af32eac847ac26d",
        "start_time": 9.45,
        "end_time": 12.49,
        "subtitle": "核磁共振成像技术通过利用人体"
      },
      {
        "key": "e18827931223962e477b14b2b8046947039ac222",
        "start_time": 12.49,
        "end_time": 14.77,
        "subtitle": "细胞中氢原子的磁性来生成"
      },
      {
        "key": "d137bf2b0c8b7a39e3f6753b7cf5d92bd877d2d9",
        "start_time": 14.77,
        "end_time": 15.97,
        "subtitle": "详细的内部图像"
      },
      {
        "key": "0773911ae0dbaa763a64352abdb6bdac3ff8f149",
        "start_time": 15.97,
        "end_time": 18.41,
        "subtitle": "为医学诊断提供了重要工具"
      }
    ]
  }
}

Response field description:

First-level Field	Second-level Field	Third-level Field	Description
code			Response status code
msg			Response message
data	id		Audio ID
	type
	status		1: generating; 9: completed
	text		Speech text
	full	url	url to download the generated audio file
		path
		duration	Audio duration
	slice
	errMsg		Error message
	errReason		Error reason
	subtitles (array type)	key	Subtitle ID
		start_time	Subtitle start time
		end_time	Subtitle end time
		subtitle	Subtitle text

Response status code description:

code	Description
0	Response successful
10400	AccessToken verification failed
40000	Parameter error
50000	System internal error

chanjing-tts

NPX Install

Tags

SKILL.md Content

Chanjing TTS

When to Use This Skill

How to Use This Skill

Obtain AccessToken

Select a Voice ID

Create Speech API

Poll Query Speech Status API