HOE DUBBELE RIJEN IN SQL TE VERWIJDEREN

In deze sectie leren we verschillende manieren om dubbele rijen te verwijderen MySQL en Oracle . Als de SQL tabel dubbele rijen bevat, moeten we de dubbele rijen verwijderen.

Voorbeeldgegevens voorbereiden

Het script maakt de tabel met de naam contacten .

 DROP TABLE IF EXISTS contacts; CREATE TABLE contacts ( id INT PRIMARY KEY AUTO_INCREMENT, first_name VARCHAR(30) NOT NULL, last_name VARCHAR(25) NOT NULL, email VARCHAR(210) NOT NULL, age VARCHAR(22) NOT NULL );

In de bovenstaande tabel hebben we de volgende gegevens ingevoegd.

 INSERT INTO contacts (first_name,last_name,email,age) VALUES (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;21&apos;), (&apos;Nick&apos;,&apos;Jonas&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Michal&apos;,&apos;Jackson&apos;,&apos;[email protected]&apos;,&apos;22&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;23&apos;), (&apos;Tom &apos;,&apos;Baker&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Ben&apos;,&apos;Barnes&apos;,&apos;[email protected]&apos;,&apos;17&apos;), (&apos;Mischa &apos;,&apos;Barton&apos;,&apos;[email protected]&apos;,&apos;18&apos;), (&apos;Sean&apos;,&apos;Bean&apos;,&apos;[email protected]&apos;,&apos;16&apos;), (&apos;Eliza&apos;,&apos;Bennett&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Michal&apos;,&apos;Krane&apos;,&apos;[email protected]&apos;,&apos;25&apos;), (&apos;Peter&apos;,&apos;Heaven&apos;,&apos;[email protected]&apos;,&apos;20&apos;), (&apos;Brian&apos;,&apos;Blessed&apos;,&apos;[email protected]&apos;,&apos;20&apos;); (&apos;Kavin&apos;,&apos;Peterson&apos;,&apos;[email protected]&apos;,&apos;30&apos;),

We voeren het script uit om testgegevens opnieuw te creëren na het uitvoeren van een VERWIJDEREN stelling .

De query retourneert gegevens uit de contactentabel:

zet in Java

 SELECT * FROM contacts ORDER BY email;

ID kaart	Voornaam	achternaam	E-mail	leeftijd
7	Ben	Barnes	[e-mailadres beveiligd]	eenentwintig
13	Brian	Gezegend	[e-mailadres beveiligd]	18
10	Eliza	Bennett	[e-mailadres beveiligd]	23
1	Kavin	Peterson	[e-mailadres beveiligd]	22
14	Kavin	Peterson	[e-mailadres beveiligd]	23
8	Misha	Barton	[e-mailadres beveiligd]	twintig
elf	Michaël	Kranen	[e-mailadres beveiligd]	17
4	Michaël	Jackson	[e-mailadres beveiligd]	18
2	Nick	Jonas	[e-mailadres beveiligd]	16
3	Petrus	Hemel	[e-mailadres beveiligd]	25
12	Petrus	Hemel	[e-mailadres beveiligd]	25
5	Sean	Boon	[e-mailadres beveiligd]	twintig
9	Sean	Boon	[e-mailadres beveiligd]	twintig
6	Tom	Bakker	[e-mailadres beveiligd]	30

De volgende SQL-query retourneert de dubbele e-mails uit de contacttabel:

 SELECT email, COUNT(email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

e-mail	AANTAL(e-mail)
[e-mailadres beveiligd]	2
[e-mailadres beveiligd]	2
[e-mailadres beveiligd]	2

We hebben drie rijen met duplicaat e-mails.

(A) Verwijder dubbele rijen met de DELETE JOIN-instructie

 DELETE t1 FROM contacts t1 INNERJOIN contacts t2 WHERE t1.id <t2.id and t1.email="t2.email;" < pre> <p> <strong>Output:</strong> </p> <pre> Query OK, three rows affected (0.10 sec) </pre> <p>Three rows had been deleted. We execute the query, given below to finds the <strong>duplicate emails</strong> from the table.</p> <pre> SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1; </pre> <p>The query returns the empty set. To verify the data from the contacts table, execute the following SQL query:</p> <pre> SELECT * FROM contacts; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>Email</td> <td>age</td> </tr> <tr> <td>7</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>13</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>10</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>1</td> <td>Kavin</td> <td>Peterson</td> <td> [email protected] </td> <td>22</td> </tr> <tr> <td>8</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>11</td> <td>Micha</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>4</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>2</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>3</td> <td>Peter</td> <td>Heaven</td> <td> [email protected] </td> <td>25</td> </tr> <tr> <td>5</td> <td>Sean</td> <td>Bean</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>6</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <p>The rows <strong>id&apos;s 9, 12, and 14</strong> have been deleted. We use the below statement to delete the duplicate rows:</p> <p>Execute the script for <strong>creating</strong> the contact.</p> <pre> DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email; </pre> <br> <table class="table"> <tr> <td>id</td> <td>first_name</td> <td>last_name</td> <td>email</td> <td>age</td> </tr> <tr> <td>1</td> <td>Ben</td> <td>Barnes</td> <td> [email protected] </td> <td>21</td> </tr> <tr> <td>2</td> <td> <strong>Kavin</strong> </td> <td> <strong>Peterson</strong></td> <td> <strong> [email protected] </strong> </td> <td> <strong>22</strong> </td> </tr> <tr> <td>3</td> <td>Brian</td> <td>Blessed</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>4</td> <td>Nick</td> <td>Jonas</td> <td> [email protected] </td> <td>16</td> </tr> <tr> <td>5</td> <td>Michal</td> <td>Krane</td> <td> [email protected] </td> <td>17</td> </tr> <tr> <td>6</td> <td>Eliza</td> <td>Bennett</td> <td> [email protected] </td> <td>23</td> </tr> <tr> <td>7</td> <td>Michal</td> <td>Jackson</td> <td> [email protected] </td> <td>18</td> </tr> <tr> <td>8</td> <td> <strong>Sean</strong> </td> <td> <strong>Bean</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>20</strong> </td> </tr> <tr> <td>9</td> <td>Mischa</td> <td>Barton</td> <td> [email protected] </td> <td>20</td> </tr> <tr> <td>10</td> <td> <strong>Peter</strong> </td> <td> <strong>Heaven</strong> </td> <td> <strong> [email protected] </strong> </td> <td> <strong>25</strong> </td> </tr> <tr> <td>11</td> <td>Tom</td> <td>Baker</td> <td> [email protected] </td> <td>30</td> </tr> </table> <h2>(B) Delete duplicate rows using an intermediate table</h2> <p>To delete a duplicate row by using the intermediate table, follow the steps given below:</p> <p> <strong>Step 1</strong> . Create a new table <strong>structure</strong> , same as the real table:</p> <pre> CREATE TABLE source_copy LIKE source; </pre> <p> <strong>Step 2</strong> . Insert the distinct rows from the original schedule of the database:</p> <pre> INSERT INTO source_copy SELECT * FROM source GROUP BY col; </pre> <p> <strong>Step 3</strong> . Drop the original table and rename the immediate table to the original one.</p> <pre> DROP TABLE source; ALTER TABLE source_copy RENAME TO source; </pre> <p>For example, the following statements delete the <strong>rows</strong> with <strong>duplicate</strong> emails from the contacts table:</p> <pre> -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts; </pre> <h2>(C) Delete duplicate rows using the ROW_NUMBER() Function</h2> <h4>Note: The ROW_NUMBER() function has been supported since MySQL version 8.02, so we should check our MySQL version before using the function.</h4> <p>The following statement uses the <strong>ROW_NUMBER ()</strong> to assign a sequential integer to every row. If the email is duplicate, the row will higher than one.</p> <pre> SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts; </pre> <p>The following SQL query returns <strong>id list</strong> of the duplicate rows:</p> <pre> SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1; </pre> <p> <strong>Output:</strong> </p> <table class="table"> <tr> <td>id</td> </tr> <tr> <td>9</td> </tr> <tr> <td>12</td> </tr> <tr> <td>14</td> </tr> </table> <h2>Delete Duplicate Records in Oracle</h2> <p>When we found the duplicate records in the table, we had to delete the unwanted copies to keep our data clean and unique. If a table has duplicate rows, we can delete it by using the <strong>DELETE</strong> statement.</p> <p>In the case, we have a column, which is not the part of <strong>group</strong> used to <strong>evaluate</strong> the <strong>duplicate</strong> records in the table.</p> <p>Consider the table given below:</p> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>03</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>04</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>05</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>06</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>07</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <br> <pre> -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) ); </pre> <br> <pre> -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;); </pre> <br> <pre> -- query data from the vegetable table SELECT * FROM vegetables; </pre> <p>Suppose, we want to keep the row with the highest <strong>VEGETABLE_ID</strong> and delete all other copies.</p> <pre> SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID); </pre> <br> <table class="table"> <tr> <td>MAX(VEGETABLE_ID)</td> </tr> <tr> <td>2</td> </tr> <tr> <td>5</td> </tr> <tr> <td>6</td> </tr> <tr> <td>7</td> </tr> </table> <p>We use the <strong>DELETE</strong> statement to delete the rows whose values in the <strong>VEGETABLE_ID COLUMN</strong> are not the <strong>highest</strong> .</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>Three rows have been deleted.</p> <pre> SELECT *FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td> <strong>02</strong> </td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td> <strong>05</strong> </td> <td>Onion</td> <td>Red</td> </tr> <tr> <td> <strong>06</strong> </td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td> <strong>07</strong> </td> <td><pumpkin td> <td>Yellow</td> </pumpkin></td></tr> </table> <p>If we want to keep the row with the lowest id, use the <strong>MIN()</strong> function instead of the <strong>MAX()</strong> function.</p> <pre> DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ); </pre> <p>The above method works if we have a column that is not part of the group for evaluating duplicate. If all values in the columns have copies, then we cannot use the <strong>VEGETABLE_ID</strong> column.</p> <p>Let&apos;s drop and create the <strong>vegetable</strong> table with a new structure.</p> <pre> DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) ); </pre> <br> <pre> INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <p>In the vegetable table, the values in all columns <strong>VEGETABLE_ID, VEGETABLE_NAME</strong> , and color have been copied.</p> <p>We can use the <strong>rowid</strong> , a locator that specifies where Oracle stores the row. Because the <strong>rowid</strong> is unique so that we can use it to remove the duplicates rows.</p> <pre> DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color ); </pre> <p>The query verifies the deletion operation:</p> <pre> SELECT * FROM vegetables; </pre> <br> <table class="table"> <tr> <td>VEGETABLE_ID</td> <td>VEGETABLE_NAME</td> <td>COLOR</td> </tr> <tr> <td>01</td> <td>Potato</td> <td>Brown</td> </tr> <tr> <td>02</td> <td>Onion</td> <td>Red</td> </tr> <tr> <td>03</td> <td>Pumpkin</td> <td>Green</td> </tr> <tr> <td>04</td> <td>Pumpkin</td> <td>Yellow</td> </tr> </table> <hr></t2.id>

Er waren drie rijen verwijderd. We voeren de onderstaande zoekopdracht uit om de dubbele e-mails van de tafel.

 SELECT email, COUNT (email) FROM contacts GROUP BY email HAVING COUNT (email) &gt; 1;

De query retourneert de lege set. Om de gegevens uit de contactentabel te verifiëren, voert u de volgende SQL-query uit:

 SELECT * FROM contacts;

ID kaart	Voornaam	achternaam	E-mail	leeftijd
7	Ben	Barnes	[e-mailadres beveiligd]	eenentwintig
13	Brian	Gezegend	[e-mailadres beveiligd]	18
10	Eliza	Bennett	[e-mailadres beveiligd]	23
1	Kavin	Peterson	[e-mailadres beveiligd]	22
8	Misha	Barton	[e-mailadres beveiligd]	twintig
elf	Michaël	Kranen	[e-mailadres beveiligd]	17
4	Michaël	Jackson	[e-mailadres beveiligd]	18
2	Nick	Jonas	[e-mailadres beveiligd]	16
3	Petrus	Hemel	[e-mailadres beveiligd]	25
5	Sean	Boon	[e-mailadres beveiligd]	twintig
6	Tom	Bakker	[e-mailadres beveiligd]	30

De rijen ID's 9, 12 en 14 zijn verwijderd. We gebruiken de onderstaande verklaring om de dubbele rijen te verwijderen:

Voer het script uit voor creëren het contact.

 DELETE c1 FROM contacts c1 INNERJ OIN contacts c2 WHERE c1.id &gt; c2.id AND c1.email = c2.email;

ID kaart	Voornaam	achternaam	e-mail	leeftijd
1	Ben	Barnes	[e-mailadres beveiligd]	eenentwintig
2	Kavin	Peterson	[e-mailadres beveiligd]	22
3	Brian	Gezegend	[e-mailadres beveiligd]	18
4	Nick	Jonas	[e-mailadres beveiligd]	16
5	Michaël	Kranen	[e-mailadres beveiligd]	17
6	Eliza	Bennett	[e-mailadres beveiligd]	23
7	Michaël	Jackson	[e-mailadres beveiligd]	18
8	Sean	Boon	[e-mailadres beveiligd]	twintig
9	Misha	Barton	[e-mailadres beveiligd]	twintig
10	Petrus	Hemel	[e-mailadres beveiligd]	25
elf	Tom	Bakker	[e-mailadres beveiligd]	30

(B) Verwijder dubbele rijen met behulp van een tussenliggende tabel

Om een dubbele rij te verwijderen met behulp van de tussenliggende tabel, volgt u de onderstaande stappen:

string parseren naar int

Stap 1 . Maak een nieuwe tabel structuur , hetzelfde als de echte tafel:

 CREATE TABLE source_copy LIKE source;

Stap 2 . Voeg de afzonderlijke rijen uit het oorspronkelijke schema van de database in:

 INSERT INTO source_copy SELECT * FROM source GROUP BY col;

Stap 3 . Verwijder de originele tabel en hernoem de directe tabel naar de originele tabel.

 DROP TABLE source; ALTER TABLE source_copy RENAME TO source;

Met de volgende instructies wordt bijvoorbeeld de rijen met duplicaat e-mails uit de contactentabel:

 -- step 1 CREATE TABLE contacts_temp LIKE contacts; -- step 2 INSERT INTO contacts_temp SELECT * FROM contacts GROUP BY email; -- step 3 DROP TABLE contacts; ALTER TABLE contacts_temp RENAME TO contacts;

(C) Verwijder dubbele rijen met de functie ROW_NUMBER().

Opmerking: de functie ROW_NUMBER() wordt ondersteund sinds MySQL versie 8.02, dus we moeten onze MySQL-versie controleren voordat we de functie gebruiken.

De volgende verklaring maakt gebruik van de RIJ NUMMER () om aan elke rij een opeenvolgend geheel getal toe te wijzen. Als de e-mail dubbel is, is de rij groter dan één.

 SELECT id, email, ROW_NUMBER() OVER (PARTITION BY email ORDER BY email ) AS row_num FROM contacts;

De volgende SQL-query retourneert id lijst van de dubbele rijen:

tekenreeks int

 SELECT id FROM (SELECT id, ROW_NUMBER() OVER ( PARTITION BY email ORDER BY email) AS row_num FROM contacts ) t WHERE row_num&gt; 1;

Uitgang:

ID kaart

Verwijder dubbele records in Oracle

Toen we de dubbele records in de tabel vonden, moesten we de ongewenste kopieën verwijderen om onze gegevens schoon en uniek te houden. Als een tabel dubbele rijen bevat, kunnen we deze verwijderen met behulp van de VERWIJDEREN stelling.

In dit geval hebben we een kolom, die geen deel uitmaakt van groep gewend om evalueren de duplicaat records in de tabel.

Beschouw de onderstaande tabel:

GROENTE_ID	GROENTE_NAME	KLEUR
01	Aardappel	Bruin
02	Aardappel	Bruin
03	Ui	Rood
04	Ui	Rood
05	Ui	Rood
06	Pompoen	Groente
07	Pompoen	Geel

 -- create the vegetable table CREATE TABLE vegetables ( VEGETABLE_ID NUMBER generated BY DEFAULT AS ID ENTITY, VEGETABLE_NAME VARCHAR2(100), color VARCHAR2(20), PRIMARY KEY (VEGETABLE_ID) );

 -- insert sample rows INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_NAME,color) VALUES(&apos;Pumpkin&apos;,&apos;Yellow&apos;);

 -- query data from the vegetable table SELECT * FROM vegetables;

Stel dat we de rij met de hoogste willen behouden GROENTE_ID en verwijder alle andere kopieën.

 SELECT MAX (VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color ORDER BY MAX(VEGETABLE_ID);

MAX(GROENTE_ID)

Wij gebruiken de VERWIJDEREN instructie om de rijen te verwijderen waarvan de waarden in de GROENTE_ID KOLOM zijn niet de hoogste .

Java-vergelijkbare interface

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MAX(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

Er zijn drie rijen verwijderd.

 SELECT *FROM vegetables;

GROENTE_ID	GROENTE_NAME	KLEUR
02	Aardappel	Bruin
05	Ui	Rood
06	Pompoen	Groente
07		Geel

Als we de rij met de laagste ID willen behouden, gebruik dan de MIN() functie in plaats van de MAX() functie.

 DELETE FROM vegetables WHERE VEGETABLE_IDNOTIN ( SELECT MIN(VEGETABLE_ID) FROM vegetables GROUP BY VEGETABLE_NAME, color );

De bovenstaande methode werkt als we een kolom hebben die geen deel uitmaakt van de groep voor het evalueren van duplicaten. Als alle waarden in de kolommen kopieën hebben, kunnen we de GROENTE_ID kolom.

Laten we het neerzetten en creëren groente tafel met een nieuwe structuur.

 DROP TABLE vegetables; CREATE TABLE vegetables ( VEGETABLE_ID NUMBER, VEGETABLE_NAME VARCHAR2(100), Color VARCHAR2(20) );

 INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1,&apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(1, &apos;Potato&apos;,&apos;Brown&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color)VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(2,&apos;Onion&apos;,&apos;Red&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(3,&apos;Pumpkin&apos;,&apos;Green&apos;); INSERT INTO vegetables (VEGETABLE_ID,VEGETABLE_NAME,color) VALUES(&apos;4,Pumpkin&apos;,&apos;Yellow&apos;); SELECT * FROM vegetables;

GROENTE_ID	GROENTE_NAME	KLEUR
01	Aardappel	Bruin
01	Aardappel	Bruin
02	Ui	Rood
02	Ui	Rood
02	Ui	Rood
03	Pompoen	Groente
04	Pompoen	Geel

In de groentetabel staan de waarden in alle kolommen VEGETABLE_ID, VEGETABLE_NAME en kleur zijn gekopieerd.

Wij kunnen gebruik maken van de luidruchtig , een locator die specificeert waar Oracle de rij opslaat. Omdat de luidruchtig is uniek, zodat we het kunnen gebruiken om de dubbele rijen te verwijderen.

 DELETE FROM Vegetables WHERE rowed NOT IN ( SELECT MIN(rowid) FROM vegetables GROUP BY VEGETABLE_ID, VEGETABLE_NAME, color );

De query verifieert de verwijderingsbewerking:

 SELECT * FROM vegetables;

GROENTE_ID	GROENTE_NAME	KLEUR
01	Aardappel	Bruin
02	Ui	Rood
03	Pompoen	Groente
04	Pompoen	Geel

TechCodeview

Hoe dubbele rijen in SQL verwijderen?