一个有关DISTINCT的问题解答

2024-07-21 02:10:45

字体：大中小

来源：转载

供稿：网友

经常会有朋友问到类似于这样的问题，
表中的数据如下
id   aa   bb
1    123   456
1    4535 54
1    60     6564
1    60     656
2    50     664
2    60     6
3    89     89
4    40     4242
希望得到的结果是
id   aa   bb
1    123   456
2    50     664
3    89     89
4    40     4242
当然了，环境是sql server
解答及分析如下：
1，不少朋友希望用distinct就解决问题，但不可能，disctinct将重复的记录忽略，
但它忽略的是完全一致的重复记录，而不是其中某个字段重复的记录，所以也只有
这样的语法
select distinct id,aa,bb from tname
其它诸如select distinct(id),aa,bb from tname 或
select id,distinct aa,bb的写法都是无效的
2, 使用group by和聚合函数
select id,max(aa) as aa,max(bb) as bb from tname group by id
可以得到如下结果
id   aa   bb
1    4535 6564
2    60   664
3    89   89
4    40   4242
id是唯一了，但不一定后面的字段是同一条记录的
3, 使用临时表
select identity(int,1,1) as tid,id,aa,bb into #tmp from tname
select t1.id,t1.aa,t1.bb from #tmp t1 where t1.tid in
(select min(t2.tid) from #tmp t2 group by t2.id)
这样可以得到符合要求的结果
不过用了两个t-sql语句，
而且如果是大数据量的话，性能问题将很突出
到目前为止，我还没找到用一个t-sql语句实现同样功能的方法，
如果谁有，希望补充

上一篇：SQL技巧：创建用来按小时报告的查询

下一篇：如何提取除最新十条记录之外的所有记录？