如何检查一个字符串看起来随机,或人产生pronouncable?字符串、pronouncable

由网友(执念)分享简介:有关识别[尽可能]机器人产生的用户名的目的。For the purpose of identifying [possible] bot-generated usernames.假设你有一个像bilbomoothof..它可能是废话一个用户名,但它仍包含pronouncable声音,所以会出现人为产生。Suppos...

有关识别[尽可能]机器人产生的用户名的目的。

For the purpose of identifying [possible] bot-generated usernames.

假设你有一个像bilbomoothof..它可能是废话一个用户名,但它仍包含pronouncable声音,所以会出现人为产生。

Suppose you have a username like "bilbomoothof" .. it may be nonsense, but it still contains pronouncable sounds and so appears human-generated.

我承认,它可能被随机音节,或字部位的字典生成的,但是让我们假设一下,如果有问题的机器人是有点垃圾。

I accept that it could have been randomly generated from a dictionary of syllables, or word parts, but let's assume for a moment that the bot in question is a bit rubbish.

假设你有这样一个用户名 sdfgbhm342r3f,向人类,这是 显然是一个随机字符串。但 这个通过程序确定的? 是否有任何可用的算法 (类似的Soundex,等等。),可以 确定在发音的声音 像这样的字符串? Suppose you have a username like "sdfgbhm342r3f", to a human this is clearly a random string. But can this be identified programatically? Are there any algorithms available (similar to Soundex, etc..) that can identify pronounceable sounds within a string like this?

适用于PHP / MySQL的大多数AP preciated解决方案。

Solutions applicable in PHP/MySQL most appreciated.

推荐答案

我想你能想到的东西一样,如果你能限制自己的英文发音的声音。对我来说(我是法国人),喜欢的 szczepan 的或字的 wawrzyniec 的是不能发音的,肯定有一定的随机性。

I guess you could think of something like that if you could restrict yourself to pronounceable sounds in english. For me (I am French), words like szczepan or wawrzyniec are unpronounceable and certainly have a certain randomness.

但它们实际上是波兰名字(意的史蒂芬和劳伦斯的)...

But they are actually Polish first names (meaning steven and lawrence)...

阅读全文

相关推荐

最新文章