
基本信息:
- 专利标题: 一种垃圾邮件的识别方法、装置以及电子设备
- 专利标题(英):Junk mail recognition method and device, and electronic equipment
- 申请号:CN201710085329.6 申请日:2017-02-17
- 公开(公告)号:CN108462624A 公开(公告)日:2018-08-28
- 发明人: 沈朝阳
- 申请人: 阿里巴巴集团控股有限公司
- 申请人地址: 英属开曼群岛大开曼资本大厦一座四层847号邮箱
- 专利权人: 阿里巴巴集团控股有限公司
- 当前专利权人: 阿里巴巴集团控股有限公司
- 当前专利权人地址: 英属开曼群岛大开曼资本大厦一座四层847号邮箱
- 代理机构: 北京市清华源律师事务所
- 代理人: 沈泳; 王永秀
- 主分类号: H04L12/58
- IPC分类号: H04L12/58
The invention discloses a junk mail recognition method comprising the steps of extracting the text of a mail to be recognized, and performing word segmentation on same, thus acquiring an entry set ofthe mail to be recognized; recognizing noise characters in the entry set through combining with a pre-acquired standard word frequency table, and computing a proportion of the noise characters in theentry set; and judging whether the proportion of the noise characters is more than a preset noise character proportion threshold, and if yes, recognizing the mail to be recognized as a junk mail. According to the junk mail recognition method, the noise characters in the mail to be recognized are recognized according to the features of the noise characters, and the situation that whether the mail to be recognized is the junk mail is further recognized according to the noise characters included in the mail to be recognized, the implementation mode is simple, and the recognition accuracy of the junk mail is higher.
公开/授权文献:
- CN108462624B 一种垃圾邮件的识别方法、装置以及电子设备 公开/授权日:2021-03-09