VS探测法基于Levenshtein距离的方法距离、方法、VS、Levenshtein

由网友(法海的女儿)分享简介:根据这评论在相关的主题,我想知道为什么Levenshtein距离为基础的方法比探测法更好。As per this comment in a related thread, I'd like to know why Levenshtein distance based methods are better than S...

根据这评论在相关的主题,我想知道为什么Levenshtein距离为基础的方法比探测法更好。

As per this comment in a related thread, I'd like to know why Levenshtein distance based methods are better than Soundex.

推荐答案

探测法是相当原始 - 它最初是为手算。它导致一个键可以比较的。

Soundex is rather primitive - it was originally developed to be hand calculated. It results in a key that can be compared.

探测法能够很好地处理西方的名字,因为它最初是为美国人口普查数据。它适用于语音比较。

Soundex works well with western names, as it was originally developed for US census data. It's intended for phonetic comparison.

Levenshtein距离着眼于两个值,并产生基于其相似性的值。它在寻找失踪或取代字母。

Levenshtein distance looks at two values and produces a value based on their similarity. It's looking for missing or substituted letters.

基本上探测法对于查找施密特和史密斯夫妇可能是同一姓氏更好。

Basically Soundex is better for finding that "Schmidt" and "Smith" might be the same surname.

Levenshtein距离是为察觉到用户已经输入错误Levnshtein更好; - )

Levenshtein distance is better for spotting that the user has mistyped "Levnshtein" ;-)

阅读全文

相关推荐

最新文章