cancel
Showing results for 
Search instead for 
Did you mean: 
Reply

Delete duplicates in a SharePoint List

Hello all,

I have got a SharePoint List with about 1400 items. It only should be arround 800 items and I found out that many items have duplicates. I don´t know why and that´s also not the point here.
My idea is to create a Flow which will get each row of the list and goes through all other rows and checks if *every* column is equal. If yes then delete this row.

I found this here in the forum:
Solved: Delete Duplicate in SharePoint List - Power Platform Community (microsoft.com)
But this is not exactly what I need I think.
I created this flow (unfortunately it is german) - only 3 conditions as an example. In reallity there are 8 columns.

 
 

Screenshot 2021-01-18 115100.jpg


I did not run it yet as I am pretty sure that this Flow will delete everything - or am I wrong?
How can I achieve that the flow deletes only duplicates but leaves one entry of each?

Thank you in advance!

1 ACCEPTED SOLUTION

Accepted Solutions
fchopo
Super User
Super User

Hello @der_cosmicyes 

In your solution you should another apply to each. I mean, when do the comparison, you are comparing the values of the same item, so it will be always true, and therefore, it will delete all items.

You need an algorithm like that:

1) Get ALL items.

2) For each item.

3) Get ALL items AGAIN

4) For each item in step 3, compare it against the item in step 2.

5) If the values are the same, remove it.

If you find the use of OData filters too complicated, you could do something like this:

getitems.png

 

Hope it helps!

Ferran

 

Did I answer your question? Please consider to mark my post as a solution to help others.
Proud to be a Flownaut!

View solution in original post

30 REPLIES 30
fchopo
Super User
Super User

Hello @der_cosmicyes 

You could do the following:

1) Get all items.

2) For each item.

3) Look for an identical one, using get items and an Odata Filter query expression.

4) If more than one element is found (check the lenght of the ouput of the previous step) -> there is a duplicate.

5) Delete the item.

Have a look at this example:

filterQuery2.png

 

Therefore, in your "Get Items 2" action, you should write an OData Filter query with the 8 fields to compare.

 

Have a look at this sources on how to work with this kind of expressions: 

Every Power Automate (MS Flow) Filter Query You Ever Wanted To Know As A Functional Consultant – DIY...

OData Filter query in SharePoint Get items in Flow (sharepains.com)

Hope it helps!

Ferran

Did I answer your question? Please consider to mark my post as a solution to help others.
Proud to be a Flownaut!

Thank you Ferran,
Thank you! This sounds a little complicated to a PowerAutomate Newbie. I haven´t heard of OData Filters but I am willing to dive deeper into it.
But one question:
Is there no way to combine "my" solution with the idea to check if more than 1 item is found?
I think my solution would delete everything but if I can check if more than 1 one was found this could be the solution. What do you think?

fchopo
Super User
Super User

Hello @der_cosmicyes 

In your solution you should another apply to each. I mean, when do the comparison, you are comparing the values of the same item, so it will be always true, and therefore, it will delete all items.

You need an algorithm like that:

1) Get ALL items.

2) For each item.

3) Get ALL items AGAIN

4) For each item in step 3, compare it against the item in step 2.

5) If the values are the same, remove it.

If you find the use of OData filters too complicated, you could do something like this:

getitems.png

 

Hope it helps!

Ferran

 

Did I answer your question? Please consider to mark my post as a solution to help others.
Proud to be a Flownaut!

Thank you Ferran,
I have not tried yet but this seems very good to me!
Espacially the condition ID(1) is NOT equal to ID(2) is a very good idea.
I will try as soon as possible and then Accept as Solution (I hope 😉 )

Best regards

Hi Ferran,
ok, I know why you wanted me to use OData Query: Performance! 🙂
Unfortunately I could not solve it using OData.
I am using now your second approach (which is really very slow...) but it works.
The only thing I changed ist to use ID of 'Get Items' and not of 'Get Items 2':

I set up a very small list with Items
* Paul McCartney
* John Lennon
* George Harrison
* Ringo Starr
* Paul McCartney

and used your second approach.
When deleting ID of 'Get Items 2' both Pauls are gone and the Beatles are a Trio 😉
Using ID of 'Get Items' worked.
You helped me very much - thank you!

@fchopo how would you remove all but one? I want to remove duplicates but keep one.

Anonymous
Not applicable

Same here! I need to remove all items but one. How can this be achieved?

 

Thank you!

Anonymous
Not applicable

I solved this recently. I have an automation that finds duplicates from a Sharepoint Online List and removes duplicates, leaving the original one.

 

The start of the flow is the same as the example before, so you use Odata to filter the items and check that the length of the results is more than 1. There are some changes I made after that and one change at the beginning of the flow. I created a variable. The variable contains the Sharepoint List data and the variable is updated after each filtering. At the end of the flow, I extract the List item ID's from the Array and use those ID's in the "Delete item" part. My flow is in finnish, but i'll try to explain with a picture:

automaatio.png

 

Hope this helps!

Thank you on your sharing.

Would it be okay for your to share more detail how the ODATA FILTERED ITEMS configured? 

 

I've tried, but it does not work. I'm new in Power automate, I wish to learn more how it works to improve the performance. 

Anonymous
Not applicable

Sure!

 

Select the "Get items" step, and open the advanced settings. Then type the code in the Odata filtering field. My code is below, but you can use whatever suits your needs. The field names are case sensitive, so keep that in mind.

 

odatafilter.PNG

 

Does this answer your question?

Thank you @Anonymous . 

 

I would like to learn the logic. Would it be okay to get your input from few items below: 

  1. From the benksyong_1-1656913167952.png

    if Title has to match with Sharepoint List column's header, then, 'benksyong_0-1656912328677.png' is refer in Dynamic content with same column header? 

  2. During Odata filtering Dynamic content selection, should I refer to 'Get items' or 'Get items 2'? why? what logic behind?
  3. If I've 8 columns in my SharePoint List that I need to match with logic if all 8 column contents are identical, then consider the line item is duplicated. In this case, if that means i need to repeat all 8 column entry in OData filtering with example,
    • Title eq 'Title' AND column2 SP header eq 'Dynamic content: column2' AND column3 SP header eq 'Dynamic content: column3' AND....... column 8 SP header eq 'Dyanamic contect: column 8' 
  4. if the 'Create the variable' is necessary step to define which specific item should be deleted?

 

 

Anonymous
Not applicable

 

I'll go trough this step by step.

 

The first step in the flow is to get the items without any filtering. Let's say that's called "Get items 1". You put the result of that into a variable (because you need to define the variable at the start of the flow) and let's say the variable is called YourVariable:

 

ilied84_0-1656914522007.png

 

Then you add the "apply to all" step, and under that you create the Odata filtered get items -step. Let's call that "Get items 2". The logic in the Odata filtering is that you are comparing two lists of items. So basically "If the title of get items 2 is equal to a title in get items 1 and Endtime in get items 2 is equal to Endtime in get items 1". This of course returns all the values, but it groups the items based on that Odata filtering. So you can have an item with one result, or an item with 4 results, or however many duplicates that filtering can find.

 

After the filtering is done, you update the variable (YourVariable) you created at the start of the flow with the results of the filtering. Then you create an if-statement step in which you check the "length" of the variable items. Length in this case means, that how many results does one item contain. When you add the if-statement, add the code:
Length(variables('YourVariable')) and select "is greater than 1" from the dropdown menu. This skips all the results that are not duplicates.

 

ilied84_1-1656915429173.png

 

The next step is "Compose". What we are doing here is a step, that skips the first result of YourVariable after the length has been checked. At this point all the non-duplicate entries are not present and we are left with ONLY the duplicates, we need to skip the first result, so that there will be one result left in the sharepoint online list. The code for the Compose -step is: skip(variables('YourVariable'),1)

 

Now it's time to update YourVariable again with the results of the Compose -step.

 

The final step is to get all the ID-numbers from YourVariable and delete the corresponding items from the Sharepoint Online list. You add the "delete items" step and in the item ID field you add the following code:

variables('YourVariable')[0]['ID']

 

This finds all the ID's and removes the list items based on those values. I'm not completely sure about the [0], but I think it removes null values, if there are any. I found this in an example and it was not explained. Anyway, the Flow works very well in my case.

 

ilied84_2-1656916536421.png

 

hi,  I ran into problem as below step: Column 'status does not exist'

benksyong_0-1657034276035.png

 

but the Status column indeed remained in my SharePoint List, and my Odata filter entry: 

benksyong_1-1657034489489.png

benksyong_2-1657034607802.png

 

With example data in SharePoint list, the duplicated item is 

Title = a AND Status = aaa. 

 

Expected result post-automate is to remove one of the item named Title= a AND Status = aaa. ...still can't get it right. seek advise. 

 

Regards,

Yong

Thanks for sharing this.

I thought it was working beautifully until I realized it's creating duplicates for each array and deleting them.

My list, which was originally at 2k (no duplicates because the other flow only run once), now only has 1 row...

Any suggestions? I followed your step exactly.

Thanks for this solution! Have you figured out a quicker version of this? My version takes more than 2 minutes to delete a duplicate entry.

Here's the flow I built:
(Overview)

kymramos_0-1657456411463.png

 

(First pair of "get items from SharePoint" + "apply action if condition was met")

kymramos_1-1657456500793.png

 

(Second pair)

kymramos_2-1657456961763.png

 

(Third and last pair)

*A replicate of the first pair, but instead, referencing the version of the SharePoint list (product of the series of Japanese-English translations).

 

New data gets loaded into the query (Power BI) whenever it gets refreshed, and so whenever the button is clicked, I think Power Automate always starts from the very first row of data and hence the duplicate entries generated. Is there a way to tell Power Automate to only create items that are not currently in the SharePoint list so that it won't have to generate duplicate entries?

 

I hope my explanation was clear!

Please let me know if you'd need me to expand.

 

Thanks a bunch!

Hi,

I used this method, but I have a problem when a title has an apostrophe in it. It breaks the ODATA query.

 

Do you have an idea of an expression that would not break the "Get the items"?

Thanks by advance

it doesn't work

@fchopo 

Hi,

 

you need to check what name/code "Get the items" creates for you columns.

The link below shows you, where you can find the correct spelt name of your columns (Post from: ScottShearer 08-16-2019 09:56 AM)


https://powerusers.microsoft.com/t5/General-Power-Automate/OData-filter-column-with-space/td-p/34241...

 

Does it help?

 

Regards

Helpful resources

Announcements

Community will be READ ONLY July 16th, 5p PDT -July 22nd

Dear Community Members,   We'd like to let you know of an upcoming change to the community platform: starting July 16th, the platform will transition to a READ ONLY mode until July 22nd.   During this period, members will not be able to Kudo, Comment, or Reply to any posts.   On July 22nd, please be on the lookout for a message sent to the email address registered on your community profile. This email is crucial as it will contain your unique code and link to register for the new platform encompassing all of the communities.   What to Expect in the New Community: A more unified experience where all products, including Power Apps, Power Automate, Copilot Studio, and Power Pages, will be accessible from one community.Community Blogs that you can syndicate and link to for automatic updates. We appreciate your understanding and cooperation during this transition. Stay tuned for the exciting new features and a seamless community experience ahead!

Summer of Solutions | Week 4 Results | Winners will be posted on July 24th

We are excited to announce the Summer of Solutions Challenge!    This challenge is kicking off on Monday, June 17th and will run for (4) weeks.  The challenge is open to all Power Platform (Power Apps, Power Automate, Copilot Studio & Power Pages) community members. We invite you to participate in a quest to provide solutions to as many questions as you can. Answers can be provided in all the communities.    Entry Period: This Challenge will consist of four weekly Entry Periods as follows (each an “Entry Period”)   - 12:00 a.m. PT on June 17, 2024 – 11:59 p.m. PT on June 23, 2024 - 12:00 a.m. PT on June 24, 2024 – 11:59 p.m. PT on June 30, 2024 - 12:00 a.m. PT on July 1, 2024 – 11:59 p.m. PT on July 7, 2024 - 12:00 a.m. PT on July 8, 2024 – 11:59 p.m. PT on July 14, 2024   Entries will be eligible for the Entry Period in which they are received and will not carryover to subsequent weekly entry periods.  You must enter into each weekly Entry Period separately.   How to Enter: We invite you to participate in a quest to provide "Accepted Solutions" to as many questions as you can. Answers can be provided in all the communities. Users must provide a solution which can be an “Accepted Solution” in the Forums in all of the communities and there are no limits to the number of “Accepted Solutions” that a member can provide for entries in this challenge, but each entry must be substantially unique and different.    Winner Selection and Prizes: At the end of each week, we will list the top ten (10) Community users which will consist of: 5 Community Members & 5 Super Users and they will advance to the final drawing. We will post each week in the News & Announcements the top 10 Solution providers.  At the end of the challenge, we will add all of the top 10 weekly names and enter them into a random drawing.  Then we will randomly select ten (10) winners (5 Community Members & 5 Super Users) from among all eligible entrants received across all weekly Entry Periods to receive the prize listed below. If a winner declines, we will draw again at random for the next winner.  A user will only be able to win once overall. If they are drawn multiple times, another user will be drawn at random.  Individuals will be contacted before the announcement with the opportunity to claim or deny the prize.  Once all of the winners have been notified, we will post in the News & Announcements of each community with the list of winners.   Each winner will receive one (1) Pass to the Power Platform Conference in Las Vegas, Sep. 18-20, 2024 ($1800 value). NOTE: Prize is for conference attendance only and any other costs such as airfare, lodging, transportation, and food are the sole responsibility of the winner. Tickets are not transferable to any other party or to next year’s event.   ** PLEASE SEE THE ATTACHED RULES for this CHALLENGE**   Week 1 Results: Congratulations to the Week 1 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge.   Community MembersNumber SolutionsSuper UsersNumber Solutions Deenuji 9 @NathanAlvares24  17 @Anil_g  7 @ManishSolanki  13 @eetuRobo  5 @David_MA  10 @VishnuReddy1997  5 @SpongYe  9JhonatanOB19932 (tie) @Nived_Nambiar  8 @maltie  2 (tie)   @PA-Noob  2 (tie)   @LukeMcG  2 (tie)   @tgut03  2 (tie)       Week 2 Results: Congratulations to the Week 2 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 2: Community MembersSolutionsSuper UsersSolutionsPower Automate  @Deenuji  12@ManishSolanki 19 @Anil_g  10 @NathanAlvares24  17 @VishnuReddy1997  6 @Expiscornovus  10 @Tjan  5 @Nived_Nambiar  10 @eetuRobo  3 @SudeepGhatakNZ 8     Week 3 Results: Congratulations to the Week 3 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 3:Community MembersSolutionsSuper UsersSolutionsPower Automate Deenuji32ManishSolanki55VishnuReddy199724NathanAlvares2444Anil_g22SudeepGhatakNZ40eetuRobo18Nived_Nambiar28Tjan8David_MA22   Week 4 Results: Congratulations to the Week 4 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 4:Community MembersSolutionsSuper UsersSolutionsPower Automate Deenuji11FLMike31Sayan11ManishSolanki16VishnuReddy199710creativeopinion14Akshansh-Sharma3SudeepGhatakNZ7claudiovc2CFernandes5 misc2Nived_Nambiar5 Usernametwice232rzaneti5 eetuRobo2   Anil_g2   SharonS2  

Check Out | 2024 Release Wave 2 Plans for Microsoft Dynamics 365 and Microsoft Power Platform

On July 16, 2024, we published the 2024 release wave 2 plans for Microsoft Dynamics 365 and Microsoft Power Platform. These plans are a compilation of the new capabilities planned to be released between October 2024 to March 2025. This release introduces a wealth of new features designed to enhance customer understanding and improve overall user experience, showcasing our dedication to driving digital transformation for our customers and partners.    The upcoming wave is centered around utilizing advanced AI and Microsoft Copilot technologies to enhance user productivity and streamline operations across diverse business applications. These enhancements include intelligent automation, AI-powered insights, and immersive user experiences that are designed to break down barriers between data, insights, and individuals. Watch a summary of the release highlights.    Discover the latest features that empower organizations to operate more efficiently and adaptively. From AI-driven sales insights and customer service enhancements to predictive analytics in supply chain management and autonomous financial processes, the new capabilities enable businesses to proactively address challenges and capitalize on opportunities.    

Updates to Transitions in the Power Platform Communities

We're embarking on a journey to enhance your experience by transitioning to a new community platform. Our team has been diligently working to create a fresh community site, leveraging the very Dynamics 365 and Power Platform tools our community advocates for.  We started this journey with transitioning Copilot Studio forums and blogs in June. The move marks the beginning of a new chapter, and we're eager for you to be a part of it. The rest of the Power Platform product sites will be moving over this summer.   Stay tuned for more updates as we get closer to the launch. We can't wait to welcome you to our new community space, designed with you in mind. Let's connect, learn, and grow together.   Here's to new beginnings and endless possibilities!   If you have any questions, observations or concerns throughout this process please go to https://aka.ms/PPCommSupport.   To stay up to date on the latest details of this migration and other important Community updates subscribe to our News and Announcements forums: Copilot Studio, Power Apps, Power Automate, Power Pages

Users online (1,199)