删除“重复对象”

假设我有一个来自同一个类的对象数组,这里有两个值得关注的属性:name和created_at。

如何在数组中找到具有相同名称的对象(被视为重复),然后删除数据库中的重复记录。 但是,具有最新created_at日期的对象是必须删除的对象。

seen = [] #sort by created date and iterate collection.sort({|a,b| a.created_at <=> b.created_at}).each do |obj| if seen.map(&:name).include? obj.name #check if the name has been seen already obj.destroy! else seen << obj #if not, add it to the seen array end end 

应该做好希望的工作。

如果在表上引入UNIQUE INDEX之前这只是一次性错误修正,那么您可以在SQL中执行此操作:

 DELETE FROM t WHERE id IN ( SELECT t1.id FROM t t1 LEFT JOIN t t2 ON t1.name = t2.name AND t2.created_at < t1.created_at WHERE t2.id IS NOT NULL )