Removing duplicate rows from a concatenated table

Question

Removing duplicate rows from a concatenated table

I have the following sql query

SELECT m.School, c.avgscore 
FROM postswithratings c 
join ZEntrycriteria on c.fk_postID= m.schoolcode

Which gives the following result

School| avgscore
xyz   |  5
xyz   |  5
xyz   |  5
abc   |  3
abc   |  3
kkk   |  1

My question is how to remove these duplicates and only get the following.

 School| avgscore
    xyz   |  5 
    abc   |  3
    kkk   |  1

I tried with

   SELECT m.School, c.avgscore 
   FROM postswithratings c 
   join ZEntrycriteria on c.fk_postID= m.schoolcode 
   group by m.School

But it gives me the following error

"The column postswithratings.avgscore 'is not valid in the select list because it is not contained in either an aggregate function or a GROUP BY clause."

+3

sql sql-server

timton Mar 31 17 at 6:39

source to share

6 answers

You only need to add the following keyword: -

 SELECT DISTINCT  m.School, c.avgscore 
  FROM postswithratings c 
 join ZEntrycriteria on c.fk_postID= m.schoolcode

+1

mansi Mar 31 17 at 6:51

source to share

This will remove duplicate lines (duplicate only)

Scheme:

CREATE TABLE #TAB (School varchar(5) , avgscore int)
INSERT INTO #TAB
SELECT 'xyz', 5
UNION ALL
SELECT 'xyz', 5
UNION ALL
SELECT 'xyz', 5
UNION ALL
SELECT 'abc', 3
UNION ALL
SELECT 'abc', 3
UNION ALL
SELECT 'kkk', 1

Now use CTE as Tempprary View parameter and delete data.

;WITH CTE AS(
SELECT ROW_NUMBER() OVER (PARTITION BY School,avgscore  ORDER BY (SELECT 1)) DUP_C, 
School,  avgscore FROM #TAB
)

DELETE FROM CTE WHERE DUP_C>1

Now check #TAB, data will be

+--------+----------+
| School | avgscore |
+--------+----------+
| xyz    |        5 |
| abc    |        3 |
| kkk    |        1 |
+--------+----------+

0

Shakeer mirza Mar 31 17 at 6:42 am

source to share

CREATE TABLE #Table2
    ([School] varchar(3), [avgscore] int)

INSERT INTO #Table2
    ([School], [avgscore])
VALUES
    ('xyz', 5),
    ('xyz', 5),
    ('xyz', 5),
    ('abc', 3),
    ('abc', 3),
    ('kkk', 1)
;
SELECT  SCHOOL,AVGSCORE FROM (SELECT *,ROW_NUMBER() OVER( PARTITION BY [AVGSCORE] ORDER BY (SELECT NULL)) AS RN FROM #TABLE2)A
WHERE RN=1
ORDER BY AVGSCORE
-------
;WITH CTE AS
(SELECT *,ROW_NUMBER() OVER( PARTITION BY [AVGSCORE] ORDER BY (SELECT NULL)) AS RN FROM #TABLE2)
SELECT SCHOOL,AVGSCORE  FROM CTE WHERE RN=1

Output

SCHOOL  AVGSCORE
kkk      1
abc      3
xyz      5

0

Chanukya Mar 31 17 at 6:43

source to share

Using the DISTINCT keyword will allow using sql sets instead of multisets. Values appear only once

0

Tobias Frischknecht Mar 31 17 at 6:49 am

source to share

only the group is used if you are using an aggregate function eg. Maximum. sum, avg

in this case

SELECT Distinct(m.School), c.avgscore 
FROM postswithratings c 
join ZEntrycriteria on c.fk_postID= m.schoolcode

0

Jesse Mar 31 17 at 7:21

source to share

Georgi Raychev · Accepted Answer · 2017-03-31T06:44:08+0000

There is no need to complicate the situation. Just go with:

SELECT m.School, c.avgscore 
FROM postswithratings c 
join ZEntrycriteria on c.fk_postID= m.schoolcode 
group by m.School, c.avgscore

or

SELECT DISTINCT m.School, c.avgscore 
FROM postswithratings c 
join ZEntrycriteria on c.fk_postID= m.schoolcode

Removing duplicate rows from a concatenated table

More articles: