cancel
Showing results for 
Search instead for 
Did you mean: 
Reply

Move mouse to text on Screen ( OCR ), worked yesterday now dont work at all?

Hello

 

I'm stuck with this failure, any ideas are welcome. 

Screenshot_4.png

Yesterday this action worked perfectly, today, it does not work at all.  Tested on the same text that it worked on yesterday, on two different installations that are exact. Nothing has changed, I copied the flow and tested; to exclude any of the hidden issues that sometimes can break a flow. 

 

Error says it cannot find the text. =(

I have further tried to improve the action by setting X1 Y1 and X2 Y2 cords to help but to no avail. 

 

Ideas welcome

 

Regards

Daniel Olsson

 

1 ACCEPTED SOLUTION

Accepted Solutions

The solution :  

 

I had a call with Microsoft support engineer Sunil Barate and he explained a few useful things.  Below is an example of a flow and how you can set it up if it does not seem to work.

 

1. Add config on the start ORC engine action to 2  2.  This enables zooming in to help read the text.

 

2. The Get Window action seems to help the ORC to work better, even if we can see the window without the Get action in my flow it didn't work. It's a pretty good idea to focus the windows that the OCR should read, so I will always set that in the future.

 

3. A wait after starting ORC engine and focusing window, to help it all be in the correct state before moving the mouse by OCR start is as well recommended, the 5 sec is excessive but as it works you could test 4 3 2 etc.

 

4. To know the ORC can read the text, you can add an extract text with ORC and check that the text you look for is in the output variable.

 

DanielOlsson_0-1627311065185.png

 

Good luck!

View solution in original post

8 REPLIES 8
businesstaken
Frequent Visitor

ocr recogniziont working erratically, pretty much an hit or miss feature.

I've experienced same thing and gave up using it, I can't understand why same word in the same page doesn't get recognized anymore

Have you found an alternative? I will log a support ticket.  if the static text can be read and later not, the product has a flaw.  OCR is very important.

Hi @DanielOlsson 

 

I've worked with OCR in the past and noticed that, sometimes, changing the multiplier helps you achieve a better result:

tkuehara_2-1626989701976.png

With this in mind, I created a "retry" structure: if multiplier 1 doesn't bring any result, I'd then retry OCR with multiplier 2. Then, if failed again, I'd try multiplier 3. Also, when you reduce the screen area to search for a text, the failure rate is lower than trying to read the entire screen (and a lot faster as well). The disadvantage is that if you have a text appearing in different parts of the screen each time you run a flow then you'll need a very complex solution. At that time, I had to create another "retry structure level" with different sections of the screen: for example, if the search failed for the entire screen for all three multipliers, then I divided the screen by two (first vertically and then horizontally) and tried to look for the same text again in the first half and then the second half for all three multipliers again, if it has failed again then I'd further divide the screen in three or four parts (depends on the resolution) and look for the text again. This made the solution a lot more reliable (say it, from a failure rate of 1 in 10 to 1 in 40), sacrificing speed and increasing complexity though.

Also notice that if you are working with Citrix, the images presented on the screen might look the same but they can be sufficiently different for the OCR engine to not recognize it. This is because of an "adaptive compression" mechanism Citrix uses that sends a different resolution depending on the currently available GPU power, network bandwith and other parameters. Long story short: when using Citrix, expect it will fail from time to time because there will always be a "new image" it never sent before.

Thank you very much. I will set up and test this and let you know if it makes a difference.  ( I bet it will ) in addition, I will share the outcome of the support ticket. 

The solution :  

 

I had a call with Microsoft support engineer Sunil Barate and he explained a few useful things.  Below is an example of a flow and how you can set it up if it does not seem to work.

 

1. Add config on the start ORC engine action to 2  2.  This enables zooming in to help read the text.

 

2. The Get Window action seems to help the ORC to work better, even if we can see the window without the Get action in my flow it didn't work. It's a pretty good idea to focus the windows that the OCR should read, so I will always set that in the future.

 

3. A wait after starting ORC engine and focusing window, to help it all be in the correct state before moving the mouse by OCR start is as well recommended, the 5 sec is excessive but as it works you could test 4 3 2 etc.

 

4. To know the ORC can read the text, you can add an extract text with ORC and check that the text you look for is in the output variable.

 

DanielOlsson_0-1627311065185.png

 

Good luck!

Thanks for sharing!

phuft
Regular Visitor

I have had this same issue many times. I use the recognize text step to look for a specific word on the page and then click it. I have done all the steps recommended below and the step still randomly fails--magnification is at 2/2, I have the flow focusing on the desired window, I have a wait before the OCR step, but it still randomly breaks.

 

The only thing that temporarily fixes the issue is to paste a copy of the failing OCR step and delete the original step. I have no idea why this works, but so far it has worked every time. I can only assume it is some type of caching issue related to engineers trying to speed up inefficient code.

 

This will give me another random number of days of the OCR working, after which I will have to repeat to get it working again. I have sat through the entire flow multiple times and watched the text I am seeking show up clearly on the screen but not get recognized. 

 

I have had the exact same issue with clicking on UI Elements--they worked for a while but then stopped and the only way to repair them was to select the UI element again in the failing step--even though the UI element had not changed. 

 

I really hope Microsoft is able to fix this product-breaking issue, because without consistently working dynamic UI element/text recognition Power Automate is almost useless.

Bretzeli
Regular Visitor

Hello,

 

Nice others notice....

 

I am a Senior System Engineer IT and have been working for over 25 years with MS products, We are MS partner and reseller.

 

The OCR function of flow/Power Automate is absolute unuseable. If the OCR should work better in a zoom 2x2 why don't they intergrate in their code direct?

 

This all looks nice at the beginning but is just the worst terrible MS solution/Code we have ever seen in 20 years.

 

We do NOT recommend to automate anything business related in Enterprise with this solution.

You just move cost from regular backoffice to IT 2nd level if this keeps on working like this.

 

Worst is that it's run fully from their cloud. My PA on W11 PRO before W10 22H2 fully patches crashed again and there is no solution to recover the changes we made. Thought it may get better with W11.

 

 

Helpful resources

Announcements

Community will be READ ONLY July 16th, 5p PDT -July 22nd

Dear Community Members,   We'd like to let you know of an upcoming change to the community platform: starting July 16th, the platform will transition to a READ ONLY mode until July 22nd.   During this period, members will not be able to Kudo, Comment, or Reply to any posts.   On July 22nd, please be on the lookout for a message sent to the email address registered on your community profile. This email is crucial as it will contain your unique code and link to register for the new platform encompassing all of the communities.   What to Expect in the New Community: A more unified experience where all products, including Power Apps, Power Automate, Copilot Studio, and Power Pages, will be accessible from one community.Community Blogs that you can syndicate and link to for automatic updates. We appreciate your understanding and cooperation during this transition. Stay tuned for the exciting new features and a seamless community experience ahead!

Summer of Solutions | Week 4 Results | Winners will be posted on July 24th

We are excited to announce the Summer of Solutions Challenge!    This challenge is kicking off on Monday, June 17th and will run for (4) weeks.  The challenge is open to all Power Platform (Power Apps, Power Automate, Copilot Studio & Power Pages) community members. We invite you to participate in a quest to provide solutions to as many questions as you can. Answers can be provided in all the communities.    Entry Period: This Challenge will consist of four weekly Entry Periods as follows (each an “Entry Period”)   - 12:00 a.m. PT on June 17, 2024 – 11:59 p.m. PT on June 23, 2024 - 12:00 a.m. PT on June 24, 2024 – 11:59 p.m. PT on June 30, 2024 - 12:00 a.m. PT on July 1, 2024 – 11:59 p.m. PT on July 7, 2024 - 12:00 a.m. PT on July 8, 2024 – 11:59 p.m. PT on July 14, 2024   Entries will be eligible for the Entry Period in which they are received and will not carryover to subsequent weekly entry periods.  You must enter into each weekly Entry Period separately.   How to Enter: We invite you to participate in a quest to provide "Accepted Solutions" to as many questions as you can. Answers can be provided in all the communities. Users must provide a solution which can be an “Accepted Solution” in the Forums in all of the communities and there are no limits to the number of “Accepted Solutions” that a member can provide for entries in this challenge, but each entry must be substantially unique and different.    Winner Selection and Prizes: At the end of each week, we will list the top ten (10) Community users which will consist of: 5 Community Members & 5 Super Users and they will advance to the final drawing. We will post each week in the News & Announcements the top 10 Solution providers.  At the end of the challenge, we will add all of the top 10 weekly names and enter them into a random drawing.  Then we will randomly select ten (10) winners (5 Community Members & 5 Super Users) from among all eligible entrants received across all weekly Entry Periods to receive the prize listed below. If a winner declines, we will draw again at random for the next winner.  A user will only be able to win once overall. If they are drawn multiple times, another user will be drawn at random.  Individuals will be contacted before the announcement with the opportunity to claim or deny the prize.  Once all of the winners have been notified, we will post in the News & Announcements of each community with the list of winners.   Each winner will receive one (1) Pass to the Power Platform Conference in Las Vegas, Sep. 18-20, 2024 ($1800 value). NOTE: Prize is for conference attendance only and any other costs such as airfare, lodging, transportation, and food are the sole responsibility of the winner. Tickets are not transferable to any other party or to next year’s event.   ** PLEASE SEE THE ATTACHED RULES for this CHALLENGE**   Week 1 Results: Congratulations to the Week 1 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge.   Community MembersNumber SolutionsSuper UsersNumber Solutions Deenuji 9 @NathanAlvares24  17 @Anil_g  7 @ManishSolanki  13 @eetuRobo  5 @David_MA  10 @VishnuReddy1997  5 @SpongYe  9JhonatanOB19932 (tie) @Nived_Nambiar  8 @maltie  2 (tie)   @PA-Noob  2 (tie)   @LukeMcG  2 (tie)   @tgut03  2 (tie)       Week 2 Results: Congratulations to the Week 2 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 2: Community MembersSolutionsSuper UsersSolutionsPower Automate  @Deenuji  12@ManishSolanki 19 @Anil_g  10 @NathanAlvares24  17 @VishnuReddy1997  6 @Expiscornovus  10 @Tjan  5 @Nived_Nambiar  10 @eetuRobo  3 @SudeepGhatakNZ 8     Week 3 Results: Congratulations to the Week 3 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 3:Community MembersSolutionsSuper UsersSolutionsPower Automate Deenuji32ManishSolanki55VishnuReddy199724NathanAlvares2444Anil_g22SudeepGhatakNZ40eetuRobo18Nived_Nambiar28Tjan8David_MA22   Week 4 Results: Congratulations to the Week 4 qualifiers, you are being entered in the random drawing that will take place at the end of the challenge. Week 4:Community MembersSolutionsSuper UsersSolutionsPower Automate Deenuji11FLMike31Sayan11ManishSolanki16VishnuReddy199710creativeopinion14Akshansh-Sharma3SudeepGhatakNZ7claudiovc2CFernandes5 misc2Nived_Nambiar5 Usernametwice232rzaneti5 eetuRobo2   Anil_g2   SharonS2  

Check Out | 2024 Release Wave 2 Plans for Microsoft Dynamics 365 and Microsoft Power Platform

On July 16, 2024, we published the 2024 release wave 2 plans for Microsoft Dynamics 365 and Microsoft Power Platform. These plans are a compilation of the new capabilities planned to be released between October 2024 to March 2025. This release introduces a wealth of new features designed to enhance customer understanding and improve overall user experience, showcasing our dedication to driving digital transformation for our customers and partners.    The upcoming wave is centered around utilizing advanced AI and Microsoft Copilot technologies to enhance user productivity and streamline operations across diverse business applications. These enhancements include intelligent automation, AI-powered insights, and immersive user experiences that are designed to break down barriers between data, insights, and individuals. Watch a summary of the release highlights.    Discover the latest features that empower organizations to operate more efficiently and adaptively. From AI-driven sales insights and customer service enhancements to predictive analytics in supply chain management and autonomous financial processes, the new capabilities enable businesses to proactively address challenges and capitalize on opportunities.    

Updates to Transitions in the Power Platform Communities

We're embarking on a journey to enhance your experience by transitioning to a new community platform. Our team has been diligently working to create a fresh community site, leveraging the very Dynamics 365 and Power Platform tools our community advocates for.  We started this journey with transitioning Copilot Studio forums and blogs in June. The move marks the beginning of a new chapter, and we're eager for you to be a part of it. The rest of the Power Platform product sites will be moving over this summer.   Stay tuned for more updates as we get closer to the launch. We can't wait to welcome you to our new community space, designed with you in mind. Let's connect, learn, and grow together.   Here's to new beginnings and endless possibilities!   If you have any questions, observations or concerns throughout this process please go to https://aka.ms/PPCommSupport.   To stay up to date on the latest details of this migration and other important Community updates subscribe to our News and Announcements forums: Copilot Studio, Power Apps, Power Automate, Power Pages

Users online (1,717)