Collins English Dictionary 20231010

Versions

version 1

  1. download
    1. mdx: https://cloud.freemdict.com/index.php/s/EsZsiH7wXLgGHFJ
    2. css: ced23.css (4.2 KB)
  2. snapshot
  3. 191817 available hwds of CED part from https://www.collinsdictionary.com/
  4. Collins unavailable words:
    1. existed in wordlist but removed then ( can be recovered from dictionary.com or other old Collins data )
      https://www.collinsdictionary.com/dictionary/english/little-russia
      https://www.collinsdictionary.com/dictionary/english/little-russian
      https://www.collinsdictionary.com/dictionary/english/uppitiness
      https://www.collinsdictionary.com/dictionary/english/uppity
      https://www.collinsdictionary.com/dictionary/english/uppityness
    2. existed CED tag but empty content:
      1. id 127047
      2. id 268317
  5. todo features:
    1. inner links
    2. audio
    3. css
    4. hwd forms

version 2 wip

The hwd is not extracted right to the content:

  1. e.g. https://www.collinsdictionary.com/dictionary/english/bacterial-community-composition
    1. Its hwd is not bacterial-community-composition
    2. Its hwd is not bacterial
    3. Its hwd is bacterial community composition

version 2: CED_231010_v231014.7z

  1. download:
    1. freemdict cloud: FreeMdict Cloud
    2. baidu pan: https://pan.baidu.com/s/10i67e9lfIHUOKOTVToTtyA?pwd=free
  2. snapshot:
  3. data version: 20231010
  4. mdx version: 20231014
    1. offline audio of word pronunciations
    2. internal links for common words
    3. css for common structures
    4. non-stand-alone entry removed
  5. todo
    1. word forms(won’t init until reprocess the whole Collins online(CCALD+CED+WNWD+RHU…)
    2. speed optimization
      1. hard-coded the style(deletion, lable, structure…) with pure HTML

FAQ

1. unrendered unicode char

It is a Unicode character that is not in your default installed fonts on your device.

Two methods:

  1. Install a large font to your system:
    e.g. 阿里推出符合GB 18030-2022中文编码字符集新标准的免费字体 select the largest one or 下載頁 .
  2. or install the font to CSS file:
    reference: font - CSS: Cascading Style Sheets | MDN

I didn’t copy the official Collins fonts into mdx for

  1. reducing complexity when making mdx
  2. reducing rendering time when loading html(and its css or font)
16 个赞

一本词典修完,英语水平也暴涨了吧?

2 个赞

https://cloud.freemdict.com/index.php/s/d7PHXYLJDJTmxQ9

这个链接失效了,能否补一下?

下不了,打不开。朋友们都用什么浏览器啊?

1 个赞

6哥你总是水论坛和我聊天,发css,难得你做了个词典完全没人气 :grin:

这版词典css外皮还原官网的ui了吗

感谢分享,坚持。

1 个赞

Thank you for your kindness to upload it again onto another workable platform! I’ll wait with patience and appreciation.

1 个赞

bug report: erroneous presentation of superscript ə in mdict android app.
Screenshot_20231115-150229_1
It should be:
Screenshot_20231115-150331_Vivaldi_Browser_Snapshot_1

1 个赞

Another bug here regarding ӯ
Screenshot_20231115-220313_MDict_1
It should be:
Screenshot_20231115-220346_Vivaldi_Browser_Snapshot_1

Are the errors caused by the mdict app or by the files? I can still copy and paste the letters despite the garbled appearance. @6lj6

修改了两点:
1、将紧凑的布局改为列表方式
2、清理mdx中没用的标签内容,修复JS错误与资源加载错误

显示效果:

7 个赞

6兄审美在线!关键是有离线语音,麦克米伦都关了,很担心柯林斯。坛子里A大的虽然也好用,但是没有离线发音。
6兄可否把同义词也整合进来,全面离线

可以分享改后的文件吗,谢谢

2 个赞

一年没上坛子了,才看到。来晚了。

1 个赞

惊艳啊,感谢!

感谢您抓取数据,希望根据数据制作最新的Collins COBUILD Advanced Learner’s Dictionary版本,其中CEFR分级和例句对于学习者帮助非常大。

1 个赞


最新的其实是有CEFR词汇分级标签的,楼主没放出来还是没抓取到,可能楼主的不是最新的Collins english dictionary

1 个赞

楼主抓取的其实是Collins English Dictionary,并且是其中的British English,并不是柯林斯高阶,如图:

The Collins website hosts a variety of dictionaries on the same page, such asLeaner’s Dictionary, CED, AED, etc. If a page is retrieved, data can be parsed for these dictionaries respectively.

1 个赞

看来没人跟帖,看来大家都不需要,怪不得没人肯抓
柯林斯不吃香了