物書堂: NHK 日本語発音アクセント新辞典

NHK 日本語発音アクセント新辞典
link to the files in gdrive


16 Likes

Thanks for sharing. After so many years, finally we got the newest edition of the pronunciation dictionary! Many ppl will benefit from your hard work!

Wish you could also make the Korean dictionary and French dictionary(the yellow one especially, which IS the biggeset French-Japanese dictionary) by Monokakido


2 Likes

yes, i’ve mentioned previously that i’m going to convert all from monokakido

as for the korean dictionary, i recently parsed it, but it’s KR → JP

unfortunately i only have access to the dump from nyaa, which was shared in this topic:

Thanks for answer. As for Japanese and Korean has different writing system, what about make a merged version of Korean-Japanese/Japanese-Korean dictionary? Then it might be easier to use

it’s surely possible to have JP → KR, since the korean dictionaries from monokakido are also JP → KR, but it’s harder to convert when you have to duplicate the articles…
頑張ります!

i dont agree in publishing mdicts that arent faithful to the original dictionaries

but there are JP → KR dicts, so dont worry

小学館 韓日・日韓辞典


here’s an example in which 대장-장이 is the headword

대장-장이
<html xmlns="http://www.w3.org/1999/xhtml" lang="ja"><head><meta http-equiv="Content-Type" content="text/html;charset=utf-8"/><meta name="viewport" content="width = device-width, initial-scale = 1.0, user-scalable = yes, minimum-scale = 0.333, maximum-scale = 3.0"/><link rel="stylesheet" href="CHOUSENGO-KJ.css" media="all"/></head><body><div class="entry">


<div class="HeadlineG">
<span class="見出し">대장-장이</span>

[<span class="発音">tɛdʒaŋdʒaŋi</span>]
</div>
<div class="本文"><div class="語義G" id="23058-D001"><span class="品詞M">名</span> 鍛冶屋<span class="ruby">(かじや)</span>,鍛冶.</div>
<div class="ParaG"><span class="類義語M">類</span> 야장(冶匠).</div>
<div class="諺G"><span class="諺M">諺</span>
<div class="諺">대장장이[대장]의 집에 식칼이 놀다</div>
<div class="諺訳">鍛冶屋に包丁が足りない(当然あるべきだと思われている所にむしろない場合が多い).</div></div>
</div>
<div id="index"><a href="#23058-D001">[名]</a></div>
</div></body></html>
</>

i’m still working on it, so please be patient.
i’ll surely post the mdict when it’s done

Never mind, there are very few non-English/Japanese language dictionaries on the forum, so we who study them are always patient​:grin::grin:

1 Like

while I’m producing the monokakido dictionaries, I’m also coding a scrapper for the dongri dictionaries, so i think that our number of Japanese dictionaries will increase significantly , and more importantly: high quality dictionaries

4 Likes

link to the files in gdrive 总是打不开,请问能否将文件上传到本站或百度网盘,谢谢。

等了这么多年
终于用上新版了

不是我不用正版
是我实在不喜欢手机版。。。

谢谢分享~

我试了下这个新版
可能要求哪位老大再优化下

老版的,词条有224067
可以直接搜索假名

新版的,词条61246
不能搜索假名

比如:縁日 えんにち
用假名 えんにち,新版搜索不到
只能搜索汉字:縁日

刚又比对了下这位老大之前发的

这个刚好相反,只能搜索假名,不能搜索汉字。。。
这2个新版NHK,得搭配用?

seems that many entries are missing: ウイルス, おいしい…

thanks for pointing out.
just to make it clear: the entries aren’t missing , they just aren’t accessible , I’ll fix it

i’ve searched in the dump from here and there’s no results for おいしい

$ NHK_ACCENT/contents$ grep -R 'おいしい' .

so i cant do anything about it.
but i have fixed the other entries, and now all the available entries are accessible

Can you search in Japanese Kana?
For example: えんにち

I can only get results by searching for Kanji 縁日.
I’ve used a modified version of mdx

Yes, you’re correct. Both ウイルス and おいしい(美味しい) are indeed not in the source file.
But when you search these two entries on your iphone [NHK 日本語発音アクセント新辞典] app, clearly these two entries are there.
So, what went wrong here?
May I suggest that during your decryption of the *.rsc source files to plain text/html files, some entries were inadvertently lost?
I badly need this audio dictionary.
Anyway, thanks for your generous sharing of these mdx/mdd files.

On the profile page of NHK日本語発音アクセント新辞典 (NHK Japanese Pronunciation Dictionary), this dictionary boasts of having 75,000 headword counts and 102,800 audio counts.
When I unpacked the mdx file you’d uploaded on Google Drive to plain text files, only 61,246 headword counts and 98,905 audio counts were found in them.
As you can see, a considerable amount of entries were missing.
Could you help check them up, and see what went wrong?

2 Likes