데이터셋 생성기 만들어봄

AI 음성 채널

알림 알림 중 알림 취소

구독자 5459명 알림수신 123명 @The_Voice

TTS, VITS, SVC와 같은 딥러닝 음성 합성 기술 관련 정보와 이야기를 공유합니다.

💾자료 데이터셋 생성기 만들어봄

배개

추천 17 비추천 0 댓글 30 조회수 3208 작성일 2024-01-18 17:24:22

https://arca.live/b/aispeech/96954933

VITS학습 해보려고 했는데 자동으로 데이터셋 만드는 걸 찾아보는데 잘 안나오는 거 같더라고

그래서 여기저기 찾아서 짜집기로 함 만들어봄

주요 기능

- 유튜트에서 비디오 음성 다운로드 (로컬파일도 가능)

- 영상 wav로 변환

- wav 보컬 추출

- 대사 추출 후 대사에 따른 음성 파일 분리

- train.txt와 val.txt 자동 생성 (4:1 비율)

주의 사항

- ffmpeg가 컴퓨터에 설치되어있어야 실행할 때 에러 안남

- 이렇게 만들어도 이상한 보컬이나 콧노래같은건 직접 좀 쳐내야 학습이 잘됨

https://github.com/hopoduck/EZVitsDataset

사용방법 같은건 깃헙에 적어놨는데 사실 나도 파이썬 뉴비라 잘 모를 수 있음..

하랏세오

2024-01-18 20:05:26

초딩영웅

2024-01-19 03:51:28

이건 번외인데 혹시 갑자기 현재 챈에서 "이 콘텐츠는 해당 국가에서 이용할 수 없습니다." 라고 뜨는데 왜 그러는지 알까요? ㅠㅠ

펼쳐보기▼

시카고피자

2024-01-19 10:09:44

그거 국가가 대한민국으로 리셋되서 그럴거에요 vpn으로 나라 바꿔야함 ㅇㅇ

펼쳐보기▼

PG641633

2024-02-12 12:42:33

*수정됨

깃헙 메뉴얼에있는거 다 따라하고 실행하니 ERROR: Could not find a version that satisfies the requirement ezvitsdataset (from versions: none)
ERROR: No matching distribution found for ezvitsdataset 뭐가 문제일까요??

펼쳐보기▼

배개

2024-02-12 15:33:58

pip install 하던중에 에러난 거 같은데 마지막줄꺼 install하다 에러남?

펼쳐보기▼

쿠루가이

2024-02-13 00:49:17

고마워 잘동작하는거 확인했어.
그런데 vits에 데이터로 활용하려면 어떻게 해야되는거야?
미안한데 간단하게 순서를 알려줄수 있어?

펼쳐보기▼

배개

2024-02-13 01:39:14

순서는 그걸로 데이터 넣고 전처리하고 학습하면 되는데 어떤 레포 기준인데?

펼쳐보기▼

쿠루가이

2024-02-13 04:35:37

https://github.com/Roista57/vits-webui.git

여기있는걸로 webui설치될때 vits도 같이 설치되서 이걸로 했어. 
여기서 vits 명령어로 전처리까지는 OK인데 막상 학습하려고하면 에러가 나서.
자세한 순서는 내가 집에가서 정리해서 올릴려고하는데 그때 좀 봐주랑!

GitHub

GitHub - Roista57/vits-webui

Contribute to Roista57/vits-webui development by creating an account on GitHub.

여기있는걸로 webui설치될때 vits도 같이 설치되서 이걸로 했어. 
여기서 vits 명령어로 전처리까지는 OK인데 막상 학습하려고하면 에러가 나서.
자세한 순서는 내가 집에가서 정리해서 올릴려고하는데 그때 좀 봐주랑!

펼쳐보기▼

배개

2024-02-13 05:02:58

*수정됨

일단 내용보면 오디오 파일 다 넣고 스텝 2부터 하면 되는거 같은데 안되는부분 있으면 말해줘봐

펼쳐보기▼

쿠루가이

2024-02-13 09:58:11

1. 유튜브에 있는 영상으로 데이터셋 만들기 : OK
cd D:\AI\vits-webui\EZVitsDataset
d:
conda activate dataset
del D:\AI\vits-webui\EZVitsDataset\output\*.*
del D:\AI\vits-webui\EZVitsDataset\download\*.*
del D:\AI\vits-webui\EZVitsDataset\filelist\*.*
main.py 내용 수정 : 다운받고 싶은 유튜브 주소나 로컬 주소
python main.py


결과
D:\AI\vits-webui\EZVitsDataset\filelist\
 + filelist.txt
 + train.txt
 + val.txt
 + config.json

D:\AI\vits-webui\EZVitsDataset\output\
 + U7wRzyG4ke0.0001.wav
 + U7wRzyG4ke0.0002.wav
 + ....(생략).....



2. 전처리 : OK

2-1. 요건 사전 설치
conda create --name vits python=3.8
conda activate vits
pip3 install torch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 --index-url https://download.pytorch.org/whl/cu117
pip install -r requirements.txt

cd monotonic_align
mkdir monotonic_align
python setup.py build_ext --inplace
cd ..

pip install numpy==1.22   # 이버전 설치하니까 됨.



(D:\AI\vits-webui\venv) D:\AI\vits>python preprocess.py --text_index 1 --filelists EZVitsDataset/filelist/train.txt EZVitsDataset/filelist/val.txt --text_cleaners "korean_cleaners"
START: EZVitsDataset/filelist/train.txt
100%|████████████████████████████████████████████████████████████████████████████████| 91/91 [00:00<00:00, 3033.12it/s]
START: EZVitsDataset/filelist/val.txt
100%|████████████████████████████████████████████████████████████████████████████████| 22/22 [00:00<00:00, 3668.10it/s]

결과
D:\AI\vits-webui\EZVitsDataset\filelist\
 + filelist.txt
 + train.txt
 + val.txt
 + train_cleaned.txt
 + val_cleaned.txt

3. config.json 내용 기입
{
  "train": {
    "log_interval": 100,
    "eval_interval": 100,
    "seed": 1234,
    "epochs": 100,
    "learning_rate": 0.0002,
    "betas": [
      0.8,
      0.99
    ],
    "eps": 1e-09,
    "batch_size": 16,
    "fp16_run": true,
    "lr_decay": 0.999875,
    "segment_size": 8192,
    "init_lr_ratio": 1,
    "warmup_epochs": 0,
    "c_mel": 45,
    "c_kl": 1.0
  },
  "data": {
    "training_files": "EZVitsDataset\\filelist\\train_cleaned.txt",
    "validation_files": "EZVitsDataset\\filelist\\val_cleaned.txt",
    "text_cleaners": [
      "korean_cleaners"
    ],
    "max_wav_value": 32768.0,
    "sampling_rate": 22050,
    "filter_length": 1024,
    "hop_length": 256,
    "win_length": 1024,
    "n_mel_channels": 80,
    "mel_fmin": 0.0,
    "mel_fmax": null,
    "add_blank": true,
    "n_speakers": 0,
    "cleaned_text": true
  },
  "model": {
    "inter_channels": 192,
    "hidden_channels": 192,
    "filter_channels": 768,
    "n_heads": 2,
    "n_layers": 6,
    "kernel_size": 3,
    "p_dropout": 0.1,
    "resblock": "1",
    "resblock_kernel_sizes": [
      3,
      7,
      11
    ],
    "resblock_dilation_sizes": [
      [
        1,
        3,
        5
      ],
      [
        1,
        3,
        5
      ],
      [
        1,
        3,
        5
      ]
    ],
    "upsample_rates": [
      8,
      8,
      2,
      2
    ],
    "upsample_initial_channel": 512,
    "upsample_kernel_sizes": [
      16,
      16,
      4,
      4
    ],
    "n_layers_q": 3,
    "use_spectral_norm": false,
    "gin_channels": 256
  },
  "speakers": [
    "0"
  ],
  "symbols": [
    "_",
    ",",
    ".",
    "!",
    "?",
    "\u2026",
    "~",
    "\u3131",
    "\u3134",
    "\u3137",
    "\u3139",
    "\u3141",
    "\u3142",
    "\u3145",
    "\u3147",
    "\u3148",
    "\u314a",
    "\u314b",
    "\u314c",
    "\u314d",
    "\u314e",
    "\u3132",
    "\u3138",
    "\u3143",
    "\u3146",
    "\u3149",
    "\u314f",
    "\u3153",
    "\u3157",
    "\u315c",
    "\u3161",
    "\u3163",
    "\u3150",
    "\u3154",
    " "
  ]
}


4. 학습
python train.py -c EZVitsDataset\filelist\config.json -m EZVitsDataset\filelist

(vits) D:\AI\vits>python train.py -c EZVitsDataset\filelist\config.json -m EZVitsDataset\filelist
INFO:filelist:{'train': {'log_interval': 100, 'eval_interval': 100, 'seed': 1234, 'epochs': 100, 'learning_rate': 0.0002, 'betas': [0.8, 0.99], 'eps': 1e-09, 'batch_size': 16, 'fp16_run': True, 'lr_decay': 0.999875, 'segment_size': 8192, 'init_lr_ratio': 1, 'warmup_epochs': 0, 'c_mel': 45, 'c_kl': 1.0}, 'data': {'training_files': 'EZVitsDataset\\filelist\\train_cleaned.txt', 'validation_files': 'EZVitsDataset\\filelist\\val_cleaned.txt', 'text_cleaners': ['korean_cleaners'], 'max_wav_value': 32768.0, 'sampling_rate': 22050, 'filter_length': 1024, 'hop_length': 256, 'win_length': 1024, 'n_mel_channels': 80, 'mel_fmin': 0.0, 'mel_fmax': None, 'add_blank': True, 'n_speakers': 0, 'cleaned_text': True}, 'model': {'inter_channels': 192, 'hidden_channels': 192, 'filter_channels': 768, 'n_heads': 2, 'n_layers': 6, 'kernel_size': 3, 'p_dropout': 0.1, 'resblock': '1', 'resblock_kernel_sizes': [3, 7, 11], 'resblock_dilation_sizes': [[1, 3, 5], [1, 3, 5], [1, 3, 5]], 'upsample_rates': [8, 8, 2, 2], 'upsample_initial_channel': 512, 'upsample_kernel_sizes': [16, 16, 4, 4], 'n_layers_q': 3, 'use_spectral_norm': False, 'gin_channels': 256}, 'speakers': ['0'], 'symbols': ['_', ',', '.', '!', '?', '…', '~', 'ㄱ', 'ㄴ', 'ㄷ', 'ㄹ', 'ㅁ', 'ㅂ', 'ㅅ', 'ㅇ', 'ㅈ', 'ㅊ', 'ㅋ', 'ㅌ', 'ㅍ', 'ㅎ', 'ㄲ', 'ㄸ', 'ㅃ', 'ㅆ', 'ㅉ', 'ㅏ', 'ㅓ', 'ㅗ', 'ㅜ', 'ㅡ', 'ㅣ', 'ㅐ', 'ㅔ', ' '], 'model_dir': 'checkpoints\\EZVitsDataset\\filelist'}
INFO:torch.distributed.distributed_c10d:Added key: store_based_barrier_key:1 to store for rank: 0
INFO:torch.distributed.distributed_c10d:Rank 0: Completed store-based barrier for key:store_based_barrier_key:1 with 1 nodes.
  0%|                                                                                            | 0/6 [00:27<?, ?it/s]
Traceback (most recent call last):
  File "train.py", line 302, in <module>
    main()
  File "train.py", line 56, in main
    mp.spawn(run, nprocs=n_gpus, args=(n_gpus, hps,))
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 240, in spawn
    return start_processes(fn, args, nprocs, join, daemon, start_method='spawn')
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 198, in start_processes
    while not context.join():
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 160, in join
    raise ProcessRaisedException(msg, error_index, failed_process.pid)
torch.multiprocessing.spawn.ProcessRaisedException:

-- Process 0 terminated with the following error:
Traceback (most recent call last):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 69, in _wrap
    fn(i, *args)
  File "D:\AI\vits\train.py", line 123, in run
    train_and_evaluate(rank, epoch, hps, [net_g, net_d], [optim_g, optim_d], [scheduler_g, scheduler_d], scaler, [train_loader, eval_loader], logger, [writer, writer_eval])
  File "D:\AI\vits\train.py", line 143, in train_and_evaluate
    for batch_idx, (x, x_lengths, spec, spec_lengths, y, y_lengths) in enumerate(tqdm(train_loader)):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\tqdm\std.py", line 1181, in __iter__
    for obj in iterable:
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 628, in __next__
    data = self._next_data()
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1333, in _next_data
    return self._process_data(data)
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1359, in _process_data
    data.reraise()
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\_utils.py", line 543, in reraise
    raise exception
IndexError: Caught IndexError in DataLoader worker process 0.
Original Traceback (most recent call last):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\_utils\worker.py", line 302, in _worker_loop
    data = fetcher.fetch(index)
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\_utils\fetch.py", line 58, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File

펼쳐보기▼

배개

2024-02-13 09:59:28

sampling rate한번 확인해봐 내쪽에서 데이터셋 만들때 기본이 44100hz임

펼쳐보기▼

쿠루가이

2024-02-13 10:03:23

미안한데 config.json 샘플 내용을 제공해줄수 있을까???ㅠㅠ

펼쳐보기▼

쿠루가이

2024-02-13 10:03:46

일단 말한대로 샘플레이트 변경하니까 다음으로 넘어갔는데 다른 문제가 나와서.^^;;

펼쳐보기▼

배개

2024-02-13 10:05:10

지금 밖이라 들어가면 확인해보고 올려줄게... 오류는 뭐라 뜨는데?

펼쳐보기▼

쿠루가이

2024-02-13 10:05:39

아래와 같이 뜨고 있오!

Traceback (most recent call last):
  File "train.py", line 302, in <module>
    main()
  File "train.py", line 56, in main
    mp.spawn(run, nprocs=n_gpus, args=(n_gpus, hps,))
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 240, in spawn
    return start_processes(fn, args, nprocs, join, daemon, start_method='spawn')
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 198, in start_processes
    while not context.join():
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 160, in join
    raise ProcessRaisedException(msg, error_index, failed_process.pid)
torch.multiprocessing.spawn.ProcessRaisedException:

-- Process 0 terminated with the following error:
Traceback (most recent call last):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\multiprocessing\spawn.py", line 69, in _wrap
    fn(i, *args)
  File "D:\AI\vits\train.py", line 123, in run
    train_and_evaluate(rank, epoch, hps, [net_g, net_d], [optim_g, optim_d], [scheduler_g, scheduler_d], scaler, [train_loader, eval_loader], logger, [writer, writer_eval])
  File "D:\AI\vits\train.py", line 143, in train_and_evaluate
    for batch_idx, (x, x_lengths, spec, spec_lengths, y, y_lengths) in enumerate(tqdm(train_loader)):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\tqdm\std.py", line 1181, in __iter__
    for obj in iterable:
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 628, in __next__
    data = self._next_data()
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1333, in _next_data
    return self._process_data(data)
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\dataloader.py", line 1359, in _process_data
    data.reraise()
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\_utils.py", line 543, in reraise
    raise exception
NotImplementedError: Caught NotImplementedError in DataLoader worker process 0.
Original Traceback (most recent call last):
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\_utils\worker.py", line 302, in _worker_loop
    data = fetcher.fetch(index)
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\_utils\fetch.py", line 58, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "C:\Users\hwyoo\miniconda3\envs\vits\lib\site-packages\torch\utils\data\_utils\fetch.py", line 58, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
  File "D:\AI\vits\data_utils.py", line 94, in __getitem__
    return self.get_audio_text_pair(self.audiopaths_and_text[index])
  File "D:\AI\vits\data_utils.py", line 62, in get_audio_text_pair
    spec, wav = self.get_audio(audiopath)
  File "D:\AI\vits\data_utils.py", line 76, in get_audio
    spec = spectrogram_torch(audio_norm, self.filter_length,
  File "D:\AI\vits\mel_processing.py", line 63, in spectrogram_torch
    y = torch.nn.functional.pad(y.unsqueeze(1), (int((n_fft-hop_size)/2), int((n_fft-hop_size)/2)), mode='reflect')
NotImplementedError: Only 2D, 3D, 4D, 5D padding with non-constant padding are supported for now

펼쳐보기▼

배개

2024-02-13 10:16:19

중간에 speaks: 하고 배열로 있는거 지워볼래...? 아마 오디오파일 스테레오/모노 관련 문제였던 거 같은데 확실히 모르겠네

펼쳐보기▼

배개

2024-02-13 10:18:02

{
  "train": {
    "log_interval": 200,
    "eval_interval": 500,
    "seed": 1234,
    "epochs": 20000,
    "learning_rate": 2e-4,
    "betas": [0.8, 0.99],
    "eps": 1e-9,
    "batch_size": 8,
    "fp16_run": false,
    "lr_decay": 0.999875,
    "segment_size": 8192,
    "init_lr_ratio": 1,
    "warmup_epochs": 0,
    "c_mel": 45,
    "c_kl": 1.0,
    "fft_sizes": [384, 683, 171],
    "hop_sizes": [30, 60, 10],
    "win_lengths": [150, 300, 60],
    "window": "hann_window"
  },
  "data": {
    "training_files": "filelists/train.txt.cleaned",
    "validation_files": "filelists/val.txt.cleaned",
    "text_cleaners": ["jke_cleaners"],
    "max_wav_value": 32768.0,
    "sampling_rate": 44100,
    "filter_length": 1024,
    "hop_length": 256,
    "win_length": 1024,
    "n_mel_channels": 80,
    "mel_fmin": 0.0,
    "mel_fmax": null,
    "add_blank": true,
    "n_speakers": 0,
    "cleaned_text": true
  },
  "model": {
    "ms_istft_vits": true,
    "mb_istft_vits": false,
    "istft_vits": false,
    "subbands": 4,
    "gen_istft_n_fft": 16,
    "gen_istft_hop_size": 4,
    "inter_channels": 192,
    "hidden_channels": 192,
    "filter_channels": 768,
    "n_heads": 2,
    "n_layers": 6,
    "kernel_size": 3,
    "p_dropout": 0.1,
    "resblock": "1",
    "resblock_kernel_sizes": [3, 7, 11],
    "resblock_dilation_sizes": [
      [1, 3, 5],
      [1, 3, 5],
      [1, 3, 5]
    ],
    "upsample_rates": [4, 4],
    "upsample_initial_channel": 512,
    "upsample_kernel_sizes": [16, 16],
    "n_layers_q": 3,
    "use_spectral_norm": false,
    "use_sdp": false
  }
}


내꺼 학습할 때 했던 config

펼쳐보기▼

쿠루가이

2024-02-13 10:38:51

음...너껄로 바꿔서 해봐도 동일한 에러가 표시되고 있어...잠깐 구글링해야봐야겠다.ㅠㅠ

펼쳐보기▼

쿠루가이

2024-02-13 10:44:18

데이터셋에 스트레오가 있어서 문제가 됐다고 하는데
https://arca.live/b/aispeech/92090166?p=1

혹시 EZVitsDataset에서 모노로 설정하는 방법을 알수 있을까?

펼쳐보기▼

배개

2024-02-13 10:49:38

ㅇㅇ 안그래도 그거인 거 같은데.. 원래 다 모노로 변환하게 해 둔건데 안바뀌는건가 싶네
내부 소스 203줄에 ac=1되어있는게 그건데

펼쳐보기▼

쿠루가이

2024-02-13 10:53:06

그러게 소스에는 ac=1로 되어 있는데 실제로 오다시티로 봤을때는 스트레오로 되어 있네..

펼쳐보기▼

쿠루가이

2024-02-13 11:04:57

형...일단 ffmpeg로 모두 모노로 바꿨는데 잘됨. 스트레오 변환 부분이 맞는거같애.

INFO:filelist:Saving model and optimizer state at iteration 1 to checkpoints\EZVitsDataset\filelist\D_0.pth
100%|████████████████████████████████████████████████████████████████████████████████████| 9/9 [01:09<00:00,  7.67s/it]
INFO:filelist:====> Epoch: 1
100%|████████████████████████████████████████████████████████████████████████████████████| 9/9 [00:34<00:00,  3.89s/it]

미안한데 시간날때 검토를 부탁해도 될까?

펼쳐보기▼

배개

2024-02-13 11:20:44

ㅇㅇ 그 문제 맞는거같은데 확인해보고 다시 답글 달아줄게 오류보고 고마워

펼쳐보기▼

쿠루가이

2024-02-13 11:30:22

배개

2024-02-13 11:51:43

*수정됨

감사 덕분에 수정함ㅋㅋㅋ 버그 수정해서 다시 올려놨고, 원인은 확인해보니까 비디오>오디오 변환할 때 모노로 변경은 했는데, 음성 분리하는쪽에서 다시 스테레오로 만들어버리는거였네..

펼쳐보기▼

쿠루가이

2024-02-13 11:58:40

고마워 형! 잘쓸겡!^^

펼쳐보기▼

읕

2024-02-20 16:37:55

ERROR:audio_separator.separator.separator:Failed to download file from https://github.com/TRvlvr/model_repo/releases/download/all_public_uvr_models/UVR_MDXNET_KARA_2

펼쳐보기▼

읕

2024-02-20 16:39:29

해당 부분에서 에러가 나네용.. UVR_MDXNET_KARA_2 모델이 필요한건가 싶어서 https://huggingface.co/seanghay/uvr_models/blob/main/UVR_MDXNET_KARA_2.onnx 여기서 다운 받아서 audio_model 폴더에 넣어서 진행해봤는데도 안됩니당..

huggingface.co

UVR_MDXNET_KARA_2.onnx · seanghay/uvr_models at main

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

해당 부분에서 에러가 나네용.. UVR_MDXNET_KARA_2 모델이 필요한건가 싶어서 https://huggingface.co/seanghay/uvr_models/blob/main/UVR_MDXNET_KARA_2.onnx 여기서 다운 받아서 audio_model 폴더에 넣어서 진행해봤는데도 안됩니당..

펼쳐보기▼

배개

2024-02-20 16:43:29

고쳐야할 거 같은데 일단 임시로 하려면 프로젝트 루트 경로에 audio_model이라는 폴더 만들고 그 모델 넣고 해봐 아마 사용중인 라이브러리에서 모델 받다가 주소가 바뀌어서 그런거라 내가 수정하긴 해야 할 듯

펼쳐보기▼

배개

2024-02-20 16:47:51

*수정됨

아마 별 차이 없을거같긴 한데 원래 주소는 https://github.com/TRvlvr/model_repo/releases/download/all_public_uvr_models/UVR_MDXNET_KARA_2.onnx 여기서 받으면 됨

펼쳐보기▼

본 게시물에 댓글을 작성하실 권한이 없습니다. 로그인 하신 후 댓글을 다실 수 있습니다. 아카라이브 로그인

전체글 개념글

최근 최근 방문 채널

최근 방문 채널

전체 일반 📄정보 💾자료 ❓질문 ❗공지 🔨운영

번호 제목

작성자 작성일 조회수 추천

공지 아카라이브 모바일 앱 이용 안내(iOS/Android)

*ㅎㅎ 2020.08.18 28135445

공지 ★필독★ AI 음성 채널 기본 통합 공지 (23-06-12)

ㅇㅇ 2023.03.06 24654

공지 ★필독★ 음성모델 공유 관련 규정 (23-06-14)

The_Voice 2023.06.13 14811

공지 AI 음성챈을 처음 방문한 히치하이커를 위한 안내서 (23-07-01)

Tacotron2 2023.06.07 43294

공지 채널 내에서 "AI 성우" 라는 용어 사용을 자제해주길 바람.

공지 국내 가수 및 스트리머, 성우를 활용한 창작물은 업로드 금지임

무명의개념 2023.07.04 4075

숨겨진 공지 펼치기(3개)

160 일반 3. 초보자를 위한 Pre-Trained Model의 설명과 이해 [7]

DeepWeb 2024.05.08 104 9

159 일반 2. 초보자를 위한 모델 붕괴 & 일반화 실패 이야기 [3]

DeepWeb 2024.05.07 141 12

158 일반 1. 초보를 위한 TensorBoard 그래프를 보는방법~! [8]

DeepWeb 2024.05.04 446 27

157 📄정보 RVC 사전학습모델 KLM, 기본(f0) 비교 [7]

성유진 2024.04.28 721 8

156 📄정보 RVC 사전학습모델 비교 [7]

piru 2024.04.27 719 9

155 일반 추가1,해결됨)RVC 한국어 사전학습모델 applio에서만 돌아감 [3]

PPAP 2024.04.23 881 10

154 📄정보 데이터셋 비교 [6]

piru 2024.04.22 699 9

153 일반 RVC 비공식 사전학습모델 모음집 [4]

PPAP 2024.04.18 1394 12

152 📄정보 신디사이저V로 RVC같은 학습 돌리는거 라이센스 위반임+잡정보 [26]

야이야이아 2024.04.16 825 8

151 📄정보 속보)보컬 분리 모델 혁명일어남 [33]

벱나난비 2024.04.07 2315 18

150 📄정보 입문이 어려운 초보자를 위한 TTS 학습 시작 부터 원리 설명– Bert-VITS2(1편) [26]

선무공신 2024.03.08 2976 7

149 📄정보 (나빼고 다아는)UVR 화음분리 팁 [8]

벱나난비 2024.03.08 1876 12

148 일반 2024.02.25 코랩 환경 업뎃 후 일부 코랩 오류 [9]

PPAP 2024.02.25 1719 8

147 일반 RVC crepe 코랩 문제 해결했습니다. [6]

Xwlcn 2024.02.21 1174 11

146 일반 같이 재밌게 AI 음성 연구 해보실분 있나요? [5]

son 2024.02.16 1307 8

145 💾자료 데이터셋 생성기 만들어봄 [30]

배개 2024.01.18 3209 17

144 💾자료 Bert-VITS2 한국어 버전 코드 공개 [24]

사과는맛있어맛있으면바나나 2024.01.16 3517 19

143 일반 가급적이면 AI 음성으로 명성 얻을 생각 안 했으면 좋겠음 [13]

The_Voice 2024.01.15 3046 24

142 📄정보 사전 훈련 모델 Ov2 뭐시기 기존 RVC 사용자들 적용방법. [7]

farthestfrontier 2024.01.09 2291 7

전체글 개념글

사용하고 계신 브라우저가 시간대 설정을 지원하지 않으므로 GMT 시간대가 적용됩니다.