« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

« 2025/05 »
일	월	화	수	목	금	토
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

반업주부의 일상 배움사

AI로 모든 음성을 복제하는 방법 | Tortoise-TTS 튜토리얼 :: ChatGPT 정리 본문

IT 인터넷/일반

AI로 모든 음성을 복제하는 방법 | Tortoise-TTS 튜토리얼 :: ChatGPT 정리

Banjubu 2023. 3. 19. 23:04

> English Summary

> English Full Text

[ 요약 ]

멋진 무료 AI 도구를 사용하여 본인의 목소리나 다른 사람의 목소리를 복제하는 방법을 보여드릴 것입니다.
이 도구를 책임 있게 사용하십시오.
당신이 복제하고자 하는 목소리의 샘플만 제공하면, 이 도구가 나머지를 처리합니다.
최고의 교육 데이터를 얻는 방법을 포함하여 단계별 과정을 안내해 드리겠습니다.

Turtle Text-To-Speech는 GitHub에서 무료로 사용할 수 있는 오픈소스 도구입니다.
이것은 자신의 컴퓨터에서 실행할 수 있습니다.
저장소에 설치 지침이 제공되지만, 우리는 Google Collaboration을 사용하여 코드를 클라우드에서 실행할 것입니다.
패키지의 저자가 Google Collaboration을 처음에 제거했지만 인터넷의 다른 좋은 사람들을 위해 사용할 수 있도록 만들었으므로 사용할 것입니다.
그러나 최상의 결과를 원하는 경우에는 설치하여 로컬로 실행하는 것을 권장합니다.
새로운 목소리를 추가하고 복제하는 방법은 이제 매우 간단합니다.

먼저 복제하려는 화자의 오디오 클립을 수집해야 합니다.
이 경우 나의 목소리를 사용하고 음성 세그먼트를 녹음할 수 있는 소프트웨어를 보여드리겠습니다.
오디오 클립은 10초 세그먼트여야 하며, 최소한 3개 세그먼트를 요구하지만, 더 많은 데이터 포인트를 포함할수록 품질이 더욱 개선됩니다.
저장해야 할 오디오 파일은 웹 파일로 저장해야 하며, 22 킬로헤르쯔 샘플링 비율이어야 합니다.
대부분의 오디오 시스템은 44 킬로헤르쯔로 녹음하므로, 22 킬로헤르쯔로 수정해야 합니다.
로컬로 실행하는 경우 "speech" 하위 디렉터리를 만들고 여기에 데이터를 넣어야 합니다.
다른 클라우드를 사용하는 경우, 마지막 세 가지 단계는 약간 다를 수 있습니다.
오디오 녹음을 위해 interactivity라는 무료 도구를 사용하고 있습니다.
이 비디오 설명에 링크를 넣겠습니다.

이것은 매우 강력한 오디오 처리 도구이지만, 지금은 기본 기능만 사용할 것입니다.
오디오를 녹음하려면 먼저 마이크를 선택해야 합니다.
여기에 "Where"를 클릭한 다음 사용할 마이크를 선택해야 합니다.
나는 이 마이크를 선택할 것입니다.
22 킬로헤르쯔로 설정하십시오.
오디오 세그먼트를 녹음하기 위해 샘플링 비율을 22 킬로헤르쯔로 설정하면 녹음 버튼을 클릭하기만 하면 되고, 녹음이 시작됩니다.
이 경우, 녹음 중입니다.
몇 초 동안 녹음했는지 확인할 수 있으며, 약 10초까지 촬영할 예정입니다.
그런 다음 녹음을 멈추기만 하면 됩니다.
그리고 파일에서 내보내기를 클릭하고 웹 파일로 내보내기를 원합니다.
디렉토리를 "audio"로 저장하고 밑줄 스타일의 세그먼트로 이동하면 약 5개 세그먼트가 필요하다는 것을 알 수 있지만, 나의 경우에는 약 10개 세그먼트로 이동하여 녹음한 것을 들을 수 있습니다.

이제 재생 버튼을 클릭하기만 하면 나의 목소리가 녹음됩니다.
이 경우 나는 10개의 다른 음성 세그먼트를 녹음했습니다.
녹음 중인데, 음질이 좋은지 확인하려고 하기 때문입니다.이제 몇 가지 지침을 따르는 방법을 알아보겠습니다.
오디오 클립을 녹음하기 전에 입력에 따라 복제 클립의 품질이 달라질 수 있으므로 주의해야 합니다.
배경 음악 노이즈나 반향 등을 포함한 클립은 피해야 할 것입니다.
또한 전화 통화 클립이나 "오"와 같은 말이 끼어 있는 클립은 지양해야 합니다.
샘플링 데이터 세트에서 제거된 것과 유사한 말이죠.
이러한 요소를 제거해야 원하는 목소리와 유사한 결과물이 나옵니다.
또한, 오디오북을 읽는 목소리를 듣고 싶다면 클립 내용은 중요하지 않습니다.
모델이 학습할 수 있는 다양한 텍스트를 제공해야 합니다.

Notebook을 실행할 준비가 되었습니다.
이 노트북을 복사하여 저장하는 것이 좋습니다.
라이브러리 몇 개를 설치해야 하므로, 먼저 런타임을 GPU로 선택한 후, 연결 버튼을 클릭합니다.
실행할 코드가 적힌 첫 번째 셀을 실행합니다.
이 셀이 실행되면 필요한 라이브러리를 설치합니다.
다음 셀을 실행하면 녹음한 파일을 선택할 수 있습니다.
업로드한 파일은 모두 선택할 수 있습니다.
다음으로, 클론 사운드가 말해야 하는 텍스트를 정의해야 합니다.
프리셋 옵션을 사용하여 빠르게 설정할 수 있습니다.
디폴트 옵션은 빠르지만, 표준은 높은 품질에 더 많은 시간이 소요됩니다.
마지막 셀을 실행하여 오디오를 생성합니다.
결과물을 확인하면, 라이크와 코멘트를 부탁드립니다.
클론 사운드가 나와 완전히 같지는 않지만, 유사함이 있습니다.
이러한 옵션 기능을 사용하여 목표에 빠르게 접근할 수 있습니다.

이 도구들을 실사용하기 전에 책임 있게 확인하여 사용하십시오.이 블록을 다시 실행한 후 최종 셀을 실행해주세요.저는 두 가지 다른 설정으로 실행했습니다.첫 번째는 Fast이고 두 번째는 Super Fast입니다.이것이 출력 음향의 품질에 어떤 영향을 미치는지 확인해보겠습니다.
샘플은 10개 정도만 제공했기 때문에 품질에 문제가 있을 수 있습니다.
또한 제가 녹음한 오디오는 실제로 좋은 품질의 오디오가 아닙니다.
제 방에서 녹음했기 때문입니다.제공한 오디오는 빠른 샘플일 뿐입니다.
이제 그것이 좋은 품질의 샘플인지 확인할 수 있습니다.이 라이브러리의 가장 좋은 기능은 실제로 음성의 톤을 제어할 수 있다는 것입니다.그래서 만약 우울하고 현명한 음성을 원한다면, 더 행복한 음성이나 누군가 열정적인 음성으로 같은 것을 말하는 것도 가능합니다.

How to Clone Any Voice With AI | Tortoise-TTS Tutorial
https://www.youtube.com/watch?v=uQ9PYhAsd6w

[ 한글 전체 ]

이 비디오에서는 이 멋진 무료 AI 도구를 사용하여 자신의 목소리 또는 다른 사람의 목소리를 복제하는 방법을 보여드리겠습니다.
이 도구를 책임감 있게 사용하기 위해 명심하십시오.
복제하려는 음성의 샘플을 제공하기만 하면 나머지는 이 도구가 자동으로 수행합니다.
가능한 최상의 훈련 데이터를 얻는 방법을 포함하여 단계별 프로세스를 안내합니다.
모든 팁과 요령을 보려면 비디오를 끝까지 시청하십시오.
이 비디오에서 시작하겠습니다.거북이 텍스트 음성 변환은 오픈소스 도구로, GitHub에서 무료로 사용할 수 있습니다.
자신의 컴퓨터에서 실행할 수 있습니다.
설치 지침은 저장소에 제공되지만, 클라우드에서 코드를 실행하기 위해 Google 콜라보레이션을 사용할 것입니다.
패키지 작성자는 원래 Google 콜라보레이션을 제거했지만 인터넷에 다른 좋은 사람들이 그것을 사용할 수 있도록 만들었으므로, 우리는 그것을 사용할 것이지만 최상의 결과를 원한다면 로컬에 설치하고 실행하는 것이 좋습니다.지침은 이제 새 음성을 추가하는 방법과 복제하는 방법을 따르는 것이 매우 간단하므로, 여기에 필요한 모든 지침이 있으므로, 먼저 음성을 복제하려는 스피커의 오디오 클립을 수집해야합니다.
제 경우에는 제 목소리를 사용하고 음성 세그먼트를 녹음할 수 있는 소프트웨어를 보여드리겠습니다.
그러면 오디오 클립은 10초 세그먼트여야 하며, 최소 3 세그먼트가 필요하지만, 더 나은 품질을 원한다면 항상 더 많은 데이터 포인트를 포함하는 것이 좋습니다.오디오 파일을 웹 파일로 저장하는 데 필요한 오디오 파일은 22 킬로헤르츠 샘플링 속도에 있어야 합니다.
대부분의 오디오 시스템은 44 킬로헤르츠를 기록하지만, 22 킬로헤르츠로 수정해야 하며 로컬에서 실행하는 경우 "음성"이라는 하위 디렉토리를 만들고 거기에 데이터를 넣어야 합니다.
다른 클라우드를 사용하는 경우 마지막 세 단계는 이제 조금 달라질 것입니다.
우리가 일부 데이터를 수집해야 하는 방법에서 벗어난 지침이므로, 오디오 녹음을 위해 저는 대화성이라는 무료 도구를 사용하고 있습니다.
비디오 설명에 링크를 넣을 것입니다.
그것은 매우 강력한 오디오 처리 도구이지만 우리는 지금 기본 기능만 사용할 것입니다.
오디오를 녹음하려면 먼저 마이크를 선택해야 하므로 여기로 와서 "어디"를 클릭한 다음 사용하려는 마이크를 선택하면 됩니다.
이 톤 마이크가 있으므로 지침에서 오디오를 기억한다면 이것을 선택하겠습니다.
22 킬로헤르츠에 맞춰 설정해주세요.오디오 세그먼트를 녹음하기 위해 샘플링 속도를 22 킬로헤르츠로 설정하고 녹음 버튼을 클릭하기만 하면 음성을 녹음하기 시작합니다.
이 경우에는 녹음 중이며, 몇 초 동안 녹음한 데이터를 볼 수 있으므로 약 10초까지 올라갔다고 가정합니다.
그리고 녹음 중지를 클릭하기만 하면 내보내기 위해 파일로 이동한 다음, 웹 파일로 내보내고 싶으므로 여기 있습니다.
디렉토리를 "오디오"로 저장하고, 세그먼트 밑줄 하나로 이동하면 약 5개의 세그먼트가 필요하지만, 제 경우에는 약 10개의 세그먼트로 이동하여 녹음한 내용을 들을 수도 있습니다.
이제 재생 버튼을 클릭하기만 하면 음성 녹음이 시작되므로, 이 경우에는 녹음 중이므로 듣고, 오디오 품질이 좋은지 확인하고 싶기 때문에 약 10개의 다른 음성 세그먼트를 녹음했습니다.
이제 따라야 할 몇 가지 지침을 살펴보겠습니다.
오디오 클립을 녹음하기 전에, 클론 클립의 품질은 입력에 따라 달라집니다.
따라야 할 지침을 제공했으므로, 예를 들어 배경 음악 소음이나 리버브가 있는 클립을 피하십시오.
이러한 클립은 훈련 데이터 세트에서 제거된 단어 연설과 유사하게 일반적으로 증폭 시스템으로 인한 왜곡이 있습니다.또한 전화 통화에서 클립을 피하고, 과도한 말더듬이나 "오"와 같은 단어가 있는 클립을 피하고 싶다고 말하고 있습니다.
오른쪽과 같은 데이터를 입력으로 제공할 때, 이러한 데이터를 피하고 싶지 않으므로 출력 소리가 어떻게 들릴지 염두에 두고 싶습니다.
예를 들어, 오디오 북을 읽는 대상 음성을 듣고 싶다면 클립에서 말하는 내용은 중요하지 않습니다.
모델이 배울 수 있도록 다양한 텍스트를 원합니다.
여기에 우리가 실행할 노트북이 있습니다.
가서 복사본을 만드는 것이 좋습니다.
사본을 만들고 사본을 실행할 것이므로 노트북 작성자와 데이터를 공유하고 싶지 않으므로 첫 번째 단계는 런타임으로 이동하여 런타임이 GPU인지 확인하고, 다른 옵션이 있지만 GPU를 선택하고 저장한 다음 연결을 클릭하는 것입니다.
매우 간단한 프로세스가 될 것입니다.
몇 개의 셀을 실행하기만 하면 어떤 코드가 실행 중인지 알 필요가 없으므로, 연결되면 실제로 이 시스템의 램, GPU 및 디스크 사용량을 볼 수 있으므로 먼저 여기에 와서 클릭하십시오.첫 번째 셀의 디스플레이 버튼은 코드를 실행하고 모델 실행에 필요한 일부 라이브러리를 설치합니다.
셀 실행이 완료되면 완료될 때까지 기다리면 이 녹색 표시가 표시되므로 아래로 내려가서 두 번째 셀을 실행하겠습니다.
재생 버튼을 클릭하기만 하면 이 셀이 큰 파일을 다운로드하여 진행 상황을 볼 수 있습니다.
좋아요, 약 4분이 걸렸습니다.
다음 셀을 실행하겠습니다.
이 셀을 실행하면 실제로 이 셀을 실행할 때 녹음한 파일을 선택하도록 요청할 것입니다.
우리가 녹음한 파일, 모든 파일을 업로드하겠습니다.
그래서 당신이 해야 할 일은 모든 파일을 선택하고 열기를 클릭하기만 하면 파일이 하나씩 업로드되기 시작합니다.
좋아, 모든 파일이 다음 블록에 업로드됩니다.
클론 사운드가 말해야 하는 텍스트를 정의하고 싶어서 내 채널에 플러그를 꽂았으므로 이 비디오가 유용하다고 생각되면 좋아요를 눌러 주시고 댓글 달기를 잊지 마세요.
사전 설정 옵션이 있으므로 빠르게 설정할 수 있는 것이 기본값이며, 꽤 빠르게 처리한 다음 표준이 있습니다.고품질이 걸리고 처리하는 데 더 많은 시간이 걸리므로, 이 자체를 실행하고 마지막 셀을 실행할 것이므로, 여기에서 오디오를 생성하고 결과가 준비되도록 결과가 어떻게 나오는지 봅시다.
이 비디오가 유용하다고 생각되면 엄지 손가락을 치켜들고 댓글을 달는 것을 잊지 마세요.
정확히 저처럼 들리지는 않지만 닮은 점이 있다고 생각합니다.
그리고 우리는 실제로 사전 설정으로 계획할 수 있으므로, 고품질 대신 고속으로 갈 수 있습니다.
이 비디오가 유용하다고 생각되면 좋아요를 누르고 댓글을 남기는 것을 잊지 마세요.
좋아요!
이 블록을 다시 실행한 다음 최종 셀을 실행하겠습니다.
그래서 저는 두 가지 설정으로 실행했습니다.
첫 번째는 Fast이고 다른 하나는 초고속입니다.
그래서 그것이 출력 사운드의 품질에 어떤 영향을 미치는지 봅시다.
이 비디오가 유용하다고 생각되면 좋아요를 누르고 댓글을 남기고, 초고속으로 봅시다.
제발 부탁합니다.
이 비디오가 유용하다고 생각되면 엄지 손가락을 치켜들고 댓글을 달는 것을 잊지 마세요.
확실히 품질 문제가 있다고 생각하고 약간의 유사점이 있을 수 있지만, 약 10개의 샘플만 제공했으며, 녹음한 오디오는 방에서 녹음했기 때문에 실제로 좋은 품질의 오디오가 아닙니다.
따라서 제공하는 오디오는 간단한 샘플일 뿐이니, 이제 좋은 품질의 샘플인지 확인해보세요.
이 라이브러리의 가장 큰 특징은 실제로 음성의 톤을 제어할 수 있다는 것입니다.
정말 슬프고, 괄호 안에 괄호 안에 넣어야 하는 것과 같은 현명한 세트를 갖고 싶다면, 같은 말을하지만 더 행복한 톤이나 누군가의 열정적인 목소리 세트에서도 똑같이 할 수 있으므로, 단순히 나는 매우 열정적입니다.
비디오가 유용하기를 바랍니다.
댓글을 달고 생각을 공유하고, 알고리즘에 도움이 되는 비디오를 좋아해 주셔서 감사합니다.
다음 비디오에서 뵙기를 바랍니다.

SMALL

[ English Summary ]

We're going to show you how to use a cool, free AI tool to replicate your voice or someone else's voice.
Please use this tool responsibly.
You only need to provide a sample of the voice you want to replicate, and the tool will do the rest.
We'll walk you through the process step-by-step, including how to get the best training data.

Turtle Text-To-Speech is an open-source tool available for free on GitHub.
It can be run on your own computer.
Installation instructions are provided in the repository, but we'll be running the code in the cloud using Google Collaboration.
The author of the package initially removed Google Collaboration, but he made it available for other good people on the internet, so we'll use it.
However, for best results, we recommend installing it and running it locally.
Adding and cloning a new voice is now very simple.

First, you need to collect an audio clip of the speaker you want to clone.
In this case, I'll use my voice and show you a software that allows you to record voice segments.
The audio clips should be 10-second segments, and we require at least three segments, but the more data points you include, the better the quality.
The audio file you need to save should be saved as a web file, and it should have a 22 kilohertz sampling rate.
Most audio systems record at 44 kilohertz, so you will need to modify it to 22 kilohertz.
If you're running locally, you'll need to create a "speech" subdirectory and put your data in it.
If you're using a different cloud, the last three steps may be slightly different.
For audio recording, I'm using a free tool called interactivity.
I'll put the link in the description of this video.

This is a very powerful audio processing tool, but for now, we're only going to use the basic features.
To record audio, we first need to select our microphone.
Here you need to click on "Where" and then select the microphone you want to use.
I'm going to select this microphone.
Set it to 22 kilohertz.
Once you set the sampling rate to 22 kilohertz to record an audio segment, all you have to do is click the record button and it will start recording.
In this case, you are recording.
You can see how many seconds we've recorded, and we're up to about 10 seconds.
And then all you have to do is stop recording.
And then under File, click on Export, and we want to export it as a web file.
Save the directory as "audio" and go to the underscore style segments, and you'll see that we need about five segments, but in my case, I went to about 10 segments and you can hear what I recorded.

Now all you have to do is click the play button and your voice will be recorded.
In this case, I have recorded 10 different voice segments.
I'm recording because I want to make sure the sound quality is good.Now let's follow some guidelines.
Before you record an audio clip, be aware that the quality of the duplicate clip will vary depending on your input.
You'll want to avoid clips that contain background music noise or echoes.
You should also avoid clips of phone calls or clips with words like "oh" in them.
These are similar to the words that were removed from the sampling dataset.
Removing these elements will ensure that your results resemble the voice you want.
Also, if you want to hear a voice reading an audiobook, the content of the clip doesn't matter.
You need to provide a variety of text for the model to learn from.

Your notebook is ready to run.
We recommend that you copy and save this notebook.
You'll need to install a few libraries, so first select the runtime as GPU, then click the Connect button.
Run the first cell with the code you want to run.
When this cell runs, it will install the necessary libraries.
Run the next cell, which allows you to select the file you recorded.
You can select any file you've uploaded.
Next, you need to define the text that the clone sound should say.
You can use the preset option to set this up quickly.
The Default option is fast, but Standard takes more time for higher quality.
Run the last cell to generate the audio.
When you see the result, please like and comment.
Your clone won't sound exactly the same as mine, but there will be similarities.
You can use these optional features to get to your goal quickly.

Please check and use these tools responsibly before putting them into practice.Run this block again and then run the final cell.I ran it with two different settings, the first is Fast and the second is Super Fast.Let's see how this affects the quality of the output sound.
I only provided about 10 samples, so there may be some quality issues.
Also, the audio that I recorded is not really good quality audio.
The audio I provided is just a quick sample.
Now you can see if it's a good quality sample.The best feature of this library is that you can actually control the tone of the voice.So if you want a gloomy and wise voice, you can also have a happier voice or someone saying the same thing in a passionate voice.

[ English Full Text ]

In this video, I'll show you how to use this cool, free AI tool to replicate your voice or someone else's voice.
Keep in mind to use this tool responsibly.
All you need to do is provide a sample of the voice you want to replicate and the tool will do the rest.
We'll walk you through the step-by-step process, including how to get the best training data possible.
Be sure to watch the video to the end to see all the tips and tricks.
Let's start with this videoTurtle text-to-speech is an open-source tool, available for free on GitHub.
You can run it on your own computer.
Installation instructions are provided in the repository, but we're going to use Google Collaboration to run the code in the cloud.
The author of the package originally removed Google Collaboration, but made it available to other good people on the internet, so we will use it, but if you want the best results, I recommend installing and running it locally.The instructions are now very simple to follow, how to add a new voice and how to clone it, so here are all the instructions you need, first you need to collect an audio clip of the speaker you want to clone the voice from.
In my case, I'm going to use my own voice and show you some software that allows you to record voice segments.
The audio clip should then be a 10-second segment, a minimum of 3 segments is required, but it is always better to include more data points if you want better quality.The audio file that you need to save as a web file must be at a 22 kilohertz sampling rate.
Most audio systems record 44 kilohertz, but you'll need to modify it to 22 kilohertz.If you're running locally, you'll need to create a subdirectory called "speech" and put your data there.
If you're using a different cloud, the last three steps will now look a little different.
This is a departure from how we need to collect some data, so for audio recording, I'm using a free tool called interactivity.
I'll put a link to it in the video description.
It's a very powerful audio processing tool, but we're only going to use the basic features for now.
In order to record audio, you first need to select a microphone, so you can come over here and click on "Where" and then select the microphone you want to use.
I have this tone microphone, so if you remember the audio from the instructions, I'm going to select this.
Set it to 22 kilohertz.To record an audio segment, I'm going to set the sampling rate to 22 kilohertz, and all I have to do is click the record button and it will start recording my voice.
In this case, it's recording, and you can see how many seconds you've recorded, so let's say it's up to about 10 seconds.
And all I have to do is click stop recording, and then I'm going to go to file to export, and I want to export it as a web file, so here it is.
Save the directory as "audio," and if you go to the segments underline one, you'll see that you need about five segments, but you can also go to about 10 segments in my case and listen to what I recorded.
Now all you have to do is click the play button and it will start recording your voice, so in this case I recorded about 10 different voice segments because I want to listen to them because I'm recording and I want to make sure the audio quality is good.
Now, let's go over some guidelines to follow.
Before you record an audio clip, remember that the quality of your clone clip will depend on your input.
We've provided guidelines to follow, so for example, avoid clips with background music noise or reverb.
We've also said that we want to avoid clips from phone calls, and we want to avoid clips with excessive stuttering or words like "oh", similar to the spoken word speech that was removed from the training dataset.
When we provide data like the one on the right as input, we don't want to avoid these things, so we want to keep in mind what the output will sound like.
For example, if we want to hear a target voice reading an audio book, it doesn't matter what is being said in the clip.
We want a variety of text so that the model can learn.
Here's the notebook we're going to run in.
I suggest you go and make a copy of it.
Since we're going to make a copy and we're going to run a copy, we don't want to share any data with the author of the notebook, so the first step is to go to the runtime and make sure the runtime is GPU, there are other options, but choose GPU, save it, and then click connect.
It's going to be a very simple process.
We're just going to run a couple of cells, we don't need to know what code is running, so once we're connected, we can actually see the ram, GPU, and disk usage of this system, so come over here and click on that first.The display button in the first cell will run the code and install some libraries that we need to run the model.
Once the cell is done running, we'll wait for it to finish and you'll see this green indicator, so let's go down and run the second cell.
All I have to do is click the play button and this cell will download a big file so I can see what's going on.
Okay, that took about four minutes.
Let's run the next cell.
When we run this cell, it's actually going to ask us to select the file that we recorded when we run this cell.
We're going to upload the files that we recorded, all the files.
So all you have to do is select all the files, click open, and they'll start uploading one by one.
Okay, all the files will be uploaded in the next block.
I want to define the text that the clone sound should say, so I've plugged my channel, so if you find this video useful, please give it a like and don't forget to comment.
There's a preset option so you can set it up quickly, it's default, it's pretty quick, and then there's standard, which is high quality and it takes more time to process, so I'm going to run this itself and then I'm going to run the last cell, so I'm going to generate the audio here and let's see how it comes out so that the result is ready.
If you find this video useful, don't forget to give it a thumbs up and comment.
It doesn't sound exactly like me, but I think there's a resemblance.
And we can actually plan with presets, so we can go fast instead of high quality.
If you found this video useful, don't forget to like and comment.
Like!
Let's run this block again and then run the final cell.
So I ran it with two different settings.
The first one is Fast and the other one is Super Fast.
So let's see how that affects the quality of the output sound.
If you found this video useful, please like and leave a comment, and let's see it in super fast.
Please do me a favor.
If you find this video useful, don't forget to give it a thumbs up and comment.
I'm sure there are some quality issues and there may be some similarities, but I've only provided about 10 samples, and the audio I recorded is not really good quality audio because I recorded it in my room.
So the audio that I've provided is just a quick sample, and now you can make sure that it's a good quality sample.
The best feature of this library is that you can actually control the tone of the voice.
So if you want to have a really sad, wise set, like I need to put it in parentheses, you can do the same thing with a set that says the same thing but in a happier tone or someone's enthusiastic voice, so simply I'm very enthusiastic.
I hope you found the video useful.
Thank you for commenting, sharing your thoughts, and liking the video to help your algorithm.
Hope to see you in the next video.

LIST

'IT 인터넷 > 일반' 카테고리의 다른 글

GPT-4 칸 아카데미 심층 데모 :: ChatGPT 정리 (0)	2023.03.20
경제적 자유로 가기 위해서는 이 4가지만 기억하세요! (ft. 월가아재) :: ChatGPT 정리 (2)	2023.03.20
"저도 깜짝 놀랐습니다." 내 능력 100%를 다 사용하는 방법 (김익한 교수 2부) :: ChatGPT 정리 (1)	2023.03.19
성공 시그널 잡아내기 :: ChatGPT 정리 (0)	2023.03.19
처음 리더가 된 사람들을 위한 조언 by 전 메타(META) 수석팀장 :: ChatGPT 정리 (1)	2023.03.19

'IT 인터넷/일반' Related Articles

Comments