首页 > 编程语言> > python – 如何推荐下一个成就

python – 如何推荐下一个成就

2019-05-22 12:42:35 作者：互联网

精简版：

我有类似StackOverflow的设置.用户获得成就.我有比SO更多的成就,让我们说10k的顺序,每个用户都有100多个成就.现在,您如何推荐(推荐)用户尝试的下一个成就？

长版：

在django中对象建模如下(仅显示重要部分)：

class User(models.Model):
    alias = models.ForeignKey(Alias)

class Alias(models.Model):
    achievements = models.ManyToManyField('Achievement', through='Achiever')

class Achievement(models.Model):
    points = models.IntegerField()

class Achiever(models.Model):
    achievement = models.ForeignKey(Achievement)
    alias = models.ForeignKey(Alias)
    count = models.IntegerField(default=1)

我的算法只是找到与登录用户共享成就的每个其他用户,然后完成所有成就并按出现次数排序：

def recommended(request) :
    user = request.user.get_profile()

    // The final response
    r = {}

    // Get all the achievements the user's aliases have received 
    // in a set so they aren't double counted
    achievements = set()
    for alias in user.alias_set.select_related('achievements').all() :
        achievements.update(alias.achievements.all())

    // Find all other aliases that have gotten at least one of the same
    // same achievements as the user
    otherAliases = set()
    for ach in achievements :
        otherAliases.update(ach.alias_set.all())

    // Find other achievements the other users have gotten in addition to
    // the shared ones.
    // And count the number of times each achievement appears
    for otherAlias in otherAliases :
        for otherAch in otherAlias.achievements.all() :
            r[otherAch] = r.get(otherAch, 0) + 1

    // Remove all the achievements that the user has already gotten
    for ach in achievements :
        r.pop(ach)

    // Sort by number of times the achievements have been received
    r = sorted(r.items(), lambda x, y: cmp(x[1], y[1]), reverse=True)

    // Put in the template for showing on the screen
    template_values = {}
    template_values['achievements'] = r

但它需要FOREVER运行,并且总是返回整个列表,这是不需要的.用户只需要前几个成就.

所以,我欢迎提出有关其他算法和/或代码改进的建议.我会在系统中为您提供推荐算法的成就:)

解决方法:

您可以推荐的一种方法是查看有多少用户已经拥有这些成就并推荐那些受欢迎的成就.当他们取得了这些成绩后,你就可以进入名单,并推荐一些不太受欢迎的名单然而,这有一种天真的假设,即每个人都想要获得流行的成就.它可能会使流行的成就变得更受欢迎,而不那么受欢迎的成就……嗯……一个安慰是,它不会占用太多资源,而且可能会非常快速地运行. (只需保留成就列表的次数)

另一种方法(试图根据他已经取得的成就来猜测用户可能会追求的成就)是使用一些机器学习算法.我认为k-nearest neighbor algorithm在这里表现相当不错.选择一个阈值,然后输出高于此阈值的所有内容.现在,我不知道这是否会比你已经拥有的更快,但是每次用户取得新的成就时,你应该只运行一次推荐引擎,存储顶部(比方说)五,然后输出它每当需要推荐时返回给用户.

我希望这有帮助. =)

标签：python,optimization,django,achievements
来源： https://codeday.me/bug/20190522/1151809.html