O X ten raw_data 整理

数据还原,源数据在二楼,共40970条,

  • 一层的 40972 则包括相同值(仅一条值)的多行记录 + key: {} 首尾两行,即40970,满且相同。
  • “id” 122910 则包括不同值(多条不同值)的一行记录,即 40970 * 3,满且唯一。
  • “word” 122790 则 40930 * 3,有重复。
      1 {
      2     "data": {            
      3         "o10dict": {
      4             "id": {
      5 +---122910 lines: "u596c17338875400e.30be04e6.154e2615987.3466": {······································································································
 122915             },
 122916             "word": {
 122917 +---122790 lines: "-ie": {··············································································································································
 245707             },
 245708             "word_body": {
 245709 +---8973718 lines: "o10dict": {·········································································································································
9219427             }
9219428         }
9219429     },
9219430     "status_code": {
9219431 +--40972 lines: "0": {··················································································································································
9260403     },
9260404     "message": {
9260405 +--40972 lines: "\u6210\u529f": {·······································································································································
9301377     }
9301378 }