Forvo Japanese

Forvo Japanese
Many thanks to @tovaremeterio

  1. 本mp3发音辞典,在GoldenDict测试没问题

  2. 本mp3发音辞典无法使用的人,可能需要其他大神来帮忙了
    Forvo Japanese1
    Forvo Japanese2
    Forvo Japanese3
    Forvo Japanese4
    Forvo Japanese5
    Forvo Japanese6

9 Likes

谢谢啦。声音清晰。

3 Likes

是否能转成opus音频格式,好像体积会小很多?

2 Likes
  1. MP3的兼容性比较好,比较少问题会发生

  2. 有需要opus音频格式可自行动手,因为mdx是现成的,解包后,用EmEditor等把.mp3取代为.opus,再打包回mdx即可使用,接着再去下载opus音频格式檔再打包为mdd即可
    FORVO Pronunciations in All Languages (>6.7 million)

2 Likes

好的,谢谢指点,这个ja的文件就是日语的opus音频文件吗?

1 Like

字头是哪里来的呢?

@hahaya As explained in my post:

Languages are organized according to their code on Forvo.

Many thanks to @supplement ! Great job ! :smiley:

3 Likes
  1. 对,就是下载opus的ja文件

  2. 将ja文件解包到Forvo Japanese目录内

  3. 此时状态为:Forvo Japanese\ja

  4. 设好下图的三个参数后,点Start,即可得到Forvo Japanese.mdd

2 Likes

嗯嗯,明白,谢谢!

1 Like

It would be great if you make another post with detailed instructions about how to compile Forvo .mdx/.mdd dictionaries for any language.

Many people would be interested in making their own .mdx/mdd Forvo for English, French, Spanish, etc.

Someone on PDAWiki sells for 20 US dollars Forvo .mdx/.mdd for each language :rage:

We should make knowledge free and provide others the tools to make their own dictionaries:smiley:

1 Like

谢谢分享

但发现一个问题
发音的顺序并不是按照网页上显示的排序

比如 さようなら

得票最多的,网页排在第1个的
词典中是排在第4个

得票第2的,网页排在第2个的
词典中是排在第10个

如果能把 发音者、投票数 也标注起
并按照投票数排序就好了

投票数 可能有变动的话,可以在线取数据
谢谢

2 Likes

这已经超出我的能力范围,所以请原谅我无法帮忙

如果您有需求,请自己动手试看看,不然就等其他大神来帮忙

而且我觉得可以正常发音使用,已经很满足、很知足了,所以就不折腾了

1 Like

流程介绍

打包mdd

打包mdx的流程

Forvo Japanese.rar (13.3 MB)
檔案解包后是一些半成品文件

name.txt、path name.txt

  1. 使用Total Commander到Forvo Japanese目录内

  2. 命令→分支视图(按「Ctrl + B」)

  3. 标记→反向选择(按 数字键盘「*」)

  4. 标记→复制名称

  5. 复制到EmEditor,取代删除后面的.opus→得到name.txt

  6. 标记→复制完整路径及名称

  7. 复制到EmEditor,取代删除前面路径,整理成「ja/directory/file.opus」→得到path name.txt

Forvo Japanese.xlsx

  1. 打开Forvo Japanese.xlsx
  2. 把前面整理好的name.txt、path name.txt的资料,复制到相关位置
  3. 整理完后,另存檔案为xxx.txt

Forvo Japanese.txt

  1. 用EmEditor整理xxx.txt(参考文件如Forvo Japanese.txt)
  2. 整理完后,确认编码是「UTF-8 不带签名」,如果不是,另存新檔就可改变编码
  3. 打包成mdx(如果有问题,用EmEditor再整理一下词头,按ctrl+G输入位置,详细请参考如下)

Mdx打包时遇到以下问题是什么原因

2 Likes

It is possible to show on the .mdx the speaker’s votes, country, and sex.

The German Forvo .mdx (Full Version) shows that info:

Download: https://forum.freemdict.com/t/topic/7355

The data about votes, speaker, sex and country is available in the metadata of Forvo Audios:

Check the folder called “Z (Tools related to Electronic Dictionaries - Scripts & Software)” > Forvo Scraping Tools.

However, it is a difficult job to use the metadata. I cannot do it by myself. IMHO, it is acceptable just to compile a “simple” .mdx that shows all the pronunciations.

The German Forvo was compiled by Xu Yinuo. He wrote a custom python script to process the metadata but the code is unreadable even for him after few months… It is messy to deal with metadata. Anyway, if you want you could use it…

1 Like

链接似乎出现了点问题,mdd无法下载。下载的时候直接显示链接失效

谢谢~~~~~

1 Like

检查中。 @雨落風間 应该好了。

3 Likes

反馈一个问题,我安装在欧陆上,假设一个词有多个发音,不论点击哪一个都是只有同一个发音,其他发音都无法听到,不知道可不可以修正,谢谢

已下载成功。感谢站长先生和制作辞典的前辈,很实用的辞典。

2 Likes

很抱歉,我没有在使用欧陆,所以无能为力
欧陆需要其他大神来帮忙了