I have aquired the original data for 旺文社国語辞典第十一版 through the official apk. It looks like this data can be converted to clear text with ebdump. Also included is a list of gaiji with their unicode equivalent, all written by hand. (thank you very much shoui520)
I’m not experienced enough to convert it myself so I’m hoping a more experienced user could convert this to a beautifully formatted mdict dictionary.
Thank you very much to shoui520 and irhello from learnjapanese.moe for providing me with the original data.
P.S.: I’m not sure which category/tag to give this post because I can’t read chinese.
Machine translation:
我通过官方apk获取了旺文社国语辞典第十一版的原始数据。看起来这些数据可以用ebdump转换为清晰的文本。此外,还包括了一份与unicode对应的外文字典清单,都是手写的。(非常感谢shoui520)
我没有足够的经验来自己转换,所以我希望更有经验的用户能将其转换为格式优美的mdict词典。
非常感谢来自learnjapanese.moe的shoui520和irhello为我提供的原始数据。
P.S.: 我不知道该给这个帖子归入哪个类别/标签,因为我不懂中文。
UPDATE
2022/10/31
I have done a complete text dump of the logovista dictionary using ebdump. I have also extracted all the images with their original file names. Furthermore, I have generated a gaijimap file that converts gaiji codes to unicode codes using a built in tool in EBWin.
The dumped file contains a bunch of codes that actually represent html elements like this:
[1120:38E]<1F09><0010>【絶<1F0E>△<1F0F>巓】<1F0A>
In this case you can see that <1F0E>
and <1F0F>
most likely correspond to <sup>
and </sup>
. Each element needs to be rebuilt using regex replacements using the android app as a reference point.
I was able to bypass the android app protection using an android emulator and lucky patcher.
我已经使用ebdump完成了logovista字典的完整文本转储。我还提取了所有带有原始文件名的图像。此外,我还生成了一个gaijimap文件,使用EBWin中的内置工具将gaiji代码转换为unicode代码。
转储文件包含一堆代码,这些代码实际上表示如下html元素:
[1120:38E]<1F09><0010>【絶<1F0E>△<1F0F>巓】<1F0A>
在这种情况下,您可以看到<1F0E>
和<1F0F>
很可能对应于<sup>
和</sup>
。需要使用android应用程序作为参考点,使用regex替换来重建每个元素。
我能够使用安卓模拟器和lucky patcher绕过安卓应用保护。