Project Webpage
[ code]
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Reference | |
FocalCodec@50 (0.65 kbps) (Ours) | |
FocalCodec@25 (0.33 kbps) (Ours) | |
FocalCodec@12.5 (0.16 kbps) (Ours) | |
BigCodec (1.04 kbps) | |
DAC (1.00 kbps) | |
EnCodec (1.50 kbps) | |
Mimi (0.69 kbps) | |
SemantiCodec (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) | |
Stable Codec (0.70 kbps) | |
WavLM6-KM (0.45 kbps) | |
WavTokenizer (0.48 kbps) |
Input | |
Reference | |
FocalCodec@50 (0.65 kbps) | |
SpeechTokenizer (1.00 kbps) |