TextSnatcher: Copy text from images, for the Linux Desktop

bpfrh · 2024-03-15T07:50:32 1710489032

I use the same script as Dibby053, copied from stackoverflow but with some tweaks to work on kde,gnome and wayland as well as x11 and with some notifications on what state it is in.

I didn't test the x11/wayland check yet, but feel free to use it and report back.

  #!/bin/bash 
  # Dependencies: tesseract-ocr imagemagick 
  # on gnome: gnome-screenshot 
  # on kde: spectacle
  # on x11: xsel
  # on wayland: wl-clipboard

  die(){
  notify-send "$1"
  exit 1
  }
  cleanup(){
  [[ -n $1 ]] &&  rm -rf "$1"
  }

  SCR_IMG=$(mktemp)  || die "failed to take screenshot"

  # shellcheck disable=SC2064
  trap "cleanup '$SCR_IMG'" EXIT

  notify-send "Select the area of the text" 
  if  which "spectacle" &> /dev/null
  then
    spectacle -r -o "$SCR_IMG.png" || die "failed to take screenshot"
  else
    gnome-screenshot -a -f "$SCR_IMG.png" || die "failed to take screenshot"
  fi

  # increase image quality with option -q from default 75 to 100
  mogrify -modulate 100,0 -resize 400% "$SCR_IMG.png"  || die "failed to convert image"
  #should increase detection rate

  tesseract "$SCR_IMG.png" "$SCR_IMG" &> /dev/null || die "failed to extract text"
  if [ "$XDG_SESSION_TYPE" == "wayland" ]
  then 
  wl-copy < "$SCR_IMG.txt" || die "failed to copy text to clipboard"
  else
  xsel -b -i  < "$SCR_IMG.txt" || die "failed to copy text to clipboard"
  fi
  notify-send "Text extracted"
  exit

edit:

Formatting

guipsp · 2024-03-15T11:25:43 1710501943

I slightly modified your script to: 1. Clean up properly 2. Run spectacle in BG mode, so the window does not pop up after screenshotting.

  #!/bin/bash 
  # Dependencies: tesseract-ocr imagemagick 
  # on gnome: gnome-screenshot 
  # on kde: spectacle
  # on x11: xsel
  # on wayland: wl-clipboard
  
  die(){
    notify-send "$1"
    exit 1
  }
  cleanup(){
    [[ -n $1 ]] && rm -r "$1"
  }
  
  SCR_IMG=$(mktemp -d) || die "failed to take screenshot"
  
  # shellcheck disable=SC2064
  trap "cleanup '$SCR_IMG'" EXIT
  
  #notify-send "Select the area of the text" 
  if  which "spectacle" &> /dev/null
  then
    spectacle -b -r -o "$SCR_IMG/scr.png" || die "failed to take screenshot"
  else
    gnome-screenshot -a -f "$SCR_IMG/scr.png" || die "failed to take screenshot"
  fi
  
  # increase image quality with option -q from default 75 to 100
  mogrify -modulate 100,0 -resize 400% "$SCR_IMG/scr.png"  || die "failed to convert image"
  #should increase detection rate
  
  tesseract "$SCR_IMG/scr.png" "$SCR_IMG/scr" &> /dev/null || die "failed to extract text"
  if [ "$XDG_SESSION_TYPE" == "wayland" ]
  then 
    wl-copy < "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"
  else
    xsel -b -i  < "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"
  fi
  notify-send "Text extracted"
  exit

palmy · 2024-03-15T21:45:43 1710539143

This is great!

Also made some minor modifications: replaced `xsel` with `xclip` and added truncated version of the copied text to the `notify-send`:

  #!/bin/bash 
  # Dependencies: tesseract-ocr imagemagick 
  # on gnome: gnome-screenshot 
  # on kde: spectacle
  # on x11: xsel
  # on wayland: wl-clipboard

  die(){
    notify-send "$1"
    exit 1
  }
  cleanup(){
    [[ -n $1 ]] && rm -r "$1"
  }

  SCR_IMG=$(mktemp -d) || die "failed to take screenshot"

  # shellcheck disable=SC2064
  trap "cleanup '$SCR_IMG'" EXIT

  #notify-send "Select the area of the text" 
  if  which "spectacle" &> /dev/null
  then
    spectacle -n -b -r -o "$SCR_IMG/scr.png" || die "failed to take screenshot"
  else
    gnome-screenshot -a -f "$SCR_IMG/scr.png" || die "failed to take screenshot"
  fi

  # increase image quality with option -q from default 75 to 100
  mogrify -modulate 100,0 -resize 400% "$SCR_IMG/scr.png"  || die "failed to convert image"
  #should increase detection rate

  tesseract "$SCR_IMG/scr.png" "$SCR_IMG/scr" &> /dev/null || die "failed to extract text"
  if [ "$XDG_SESSION_TYPE" == "wayland" ]
  then 
    wl-copy < "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"
  else
    # xsel -b -i  < "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"
    xclip -selection clipboard -i < "$SCR_IMG/scr.txt" || die "failed to copy text to clipboard"  
  fi
  # Notify the user what was copied but truncate the text to 100 characters
  notify-send "Text extracted from image" "$(head -c 100 "$SCR_IMG/scr.txt")" || die "failed to send notification"
  exit

boneitis · 2024-03-16T02:05:45 1710554745

I just frankenstein'd a few people's versions into my own MATE-based flavor.

For anyone running into barriers, mate-screenshot has no outfile `-f` option, so I worked around that by outputting through clipboard and capturing that with `xclip` (note, this is earlier in the script than the the xsel/xclip line in the parent and gp comments):

  mate-screenshot -a -c && xclip -selection clipboard -t image/png -o > "$SCR_IMG/scr.png" || die "failed to take screenshot"

The other hiccup is that the dumped text file has two extraneous bytes '\x0a\x0c', so I truncated them with `head`:

  (xclip -selection clipboard -i < <(head -c -2 "$SCR_IMG/scr.txt")) || die "failed to copy text to clipboard"

Might not be pretty, but it looks like this will work for me. Thank you all for this!

bpfrh · 2024-03-15T15:47:53 1710517673

Good catch with spectacle, I thought I fixed that already.

Why did you remove the -f parameter?

rjzzleep · 2024-03-15T08:07:04 1710490024

I like all the error handling, but you could skip the temp files if you just pipe it through

    #!/usr/bin/env bash
    langs=(eng ara fas chi_sim chi_tra deu ell fin heb hun jpn kor nld rus tur)
    lang=$(printf '%s\n' "${langs[@]}" | fuzzel -d "$@")
    grim -g "$(slurp)" - | mogrify -modulate 100,0 -resize 400% png:- | tesseract -l eng+${lang} - - | wl-copy
    notify-send "Text extracted"

miduil · 2024-03-15T11:14:11 1710501251

If you just put `set -o errexit -o pipefail -o nounset` in the first line after the shebang your script will have proper error-handling as well. Currently if any fails, notify-send will still be triggered.

bpfrh · 2024-03-15T15:49:20 1710517760

This version looks nice and short, any thoughts on prober error reporting to the end user?

My version has more feedback for the user which was important because the user was somebody not familiar with linux/bash, but even my version "swallows" errors.

rjzzleep · 2024-03-15T16:53:19 1710521599

I added the `set pipefile...` suggested below, but I think mogrify only fails if the screenshot fails. Tesseract never fails if there is a valid input image, so realistically you only need one error message for the screenshot generation, unless you want to check whether the user misses any of the tools.

tmerse · 2024-03-15T08:43:21 1710492201

I also used the very same script until I stumbled upon this on hn [0].

    #!/usr/bin/env bash
    langs=(eng ara fas chi_sim chi_tra deu ell fin heb hun jpn kor nld rus tur)
    lang=$(printf '%s\n' "${langs[@]}" | dmenu "$@")
    maim -us | tesseract --dpi 145 -l eng+${lang} - - | xsel -bi

[0]: https://news.ycombinator.com/item?id=33704483#33705272

tmerse · 2024-03-15T08:48:16 1710492496

Ah just saw rjzzleep posted an updated version here. Happy to steal this one again :)

begueradj · 2024-03-15T12:51:19 1710507079

Looks nice

Arch-TK · 2024-03-15T10:07:00 1710497220

    # shellcheck disable=SC2064
    trap "cleanup '$SCR_IMG'" EXIT

While shellcheck can have false positives, and SCR_IMG probably doesn't have any characters which need escaping, it's not exactly wrong in this case.

The command passed to `trap` is evaluated normally, so variable expansions do take place.

    trap 'cleanup "$SCR_IMG"' EXIT

Will behave correctly, and the expansion of SCR_IMG won't be susceptible to issues relating to unquoted shell characters.

Alternatively, if you're using a modern bash (this probably won't work on a mac by default), then this is an option too:

    trap "cleanup ${SCR_IMG@Q}" EXIT

bpfrh · 2024-03-15T15:52:55 1710517975

thanks for fixing and explaining that, I thought '' would work and forgot about escaping characters.

Gormo · 2024-03-18T13:42:54 1710769374

Binding a hotkey to `bash -c 'flameshot gui -s -r | tesseract - - | gxmessage -title "Decoded Data" -fn "Consolas 12" -wrap -geometry 640x480 -file -'` does the job for me.

I just press the hotkey (Super+O), drag the selection over whatever I want to OCR, then immediately get a popup dialog containing the captured text.

jonquark · 2024-03-15T10:13:01 1710497581

The Wayland leg works fine for me on gnome+wayland.

bpfrh · 2024-03-15T15:50:03 1710517803

thanks!

Dibby053 · 2024-03-15T03:34:25 1710473665

A while back I copied from somewhere this script that does the job nicely.

  #!/bin/bash
  # Dependencies: tesseract-ocr imagemagick scrot xsel

  IMG=`mktemp`
  trap "rm $IMG*" EXIT

  scrot -s $IMG.png -q 100
  # increase image quality with option -q from default 75 to 100

  mogrify -modulate 100,0 -resize 400% $IMG.png
  #should increase detection rate

  tesseract $IMG.png $IMG &> /dev/null
  cat $IMG.txt | xsel -bi
  notify-send "Text copied" "$(cat $IMG.txt)"

  exit

grimgrin · 2024-03-15T05:30:24 1710480624

In the spirit of sharing, cuz I think this is a great script (thank you), I prefer using maim over scrot simply because it has a --nodrag option. Personally feels better when making selections from a trackpad. Click once, move cursor, click again.

    maim -s --nodrag --quality=10 $IMG.png

10 is scrot's 100

raphman · 2024-03-15T08:17:37 1710490657

Yet another variation I have been using for ages, using ImageMagick's `import` tool (which probably only works on X11)

    import "$tempfile"
    TEXT=`tesseract -l eng+deu "$tempfile" stdout`
    echo "$TEXT" | xsel -i -b

dsp_person · 2024-03-15T03:50:05 1710474605

I was using something like this for awhile, but I found tesseract did poorly quite often. That resize trick didn't seem to affect much. I'm not sure what pre-processing would make it better.

I'd love to if TextSnatcher does anything to improve on this. The github page is opaque.

mappu · 2024-03-15T04:38:20 1710477500

The source is pretty straightforward - it's calling `scrot -s -o` to a temp file, and then `tessaract` with no further preprocessing.

https://github.com/RajSolai/TextSnatcher/blob/master/src/ser...

stevesimmons · 2024-03-15T14:59:32 1710514772

> I found tesseract did poorly quite often

The script calls Tesseract in default page segmentation mode (PSM 3). [1]

Depending on the input text, PSM mode 11 for disconnected text would probably work much better. That uses the flag "--psm 11".

[1] From the original repo: string tess_command = "tesseract " + file_path + " " + out_path + @" -l $lang" ;

aidenn0 · 2024-03-15T20:03:47 1710533027

Having used Tesseract for OCR for other things, getting the right PSM helps but it's still rather terrible, especially for sans-serif fonts, which are common in UIs.

Granted there's a lot of ambiguity in sans serif fonts, lower-case "L", vertical bar, and upper-case "i" can even be pixel-identical, but I've seen tesseract turn

  Chapter III

into

  Chapter |l1

which really surprises me. In fact, for books, I run it through sed to replace vertical bar with upper-case "i" and it significantly improved recognition.

hiAndrewQuinn · 2024-03-15T07:15:23 1710486923

I had a PowerShell script which did this as well, but alas, it was lost to time with the rest of my little scripts from my last job.

Apologies to all of my fellow Unix-Windows borderers.

Arch-TK · 2024-03-15T10:15:33 1710497733

  trap "rm $IMG*" EXIT

see https://www.shellcheck.net/wiki/SC2064

also, use mktemp -d and recursively delete the directory

doix · 2024-03-15T04:56:56 1710478616

This is perfect for me! Having a window with a button that I need to click is much worse than just binding a script to a hotkey.

cfiggers · 2024-03-15T03:37:37 1710473857

For my fellow Windows-using plebians, the official Microsoft PowerToys add-in [0] has a feature that does this (it's also been added to the stock screenshot tool, but I personally find the one keyboard shortcut in PowerToys more pleasant to use).

[0] https://github.com/microsoft/PowerToys

fredzel · 2024-03-15T04:50:02 1710478202

Snipping tool build in OCR works for multiple languages (English, Russian, Chinese, Japanese etc.) without the need to install any language OCR packs though

lysp · 2024-03-15T05:45:36 1710481536

Inbuilt snip tool does that too.

WIN+SHIFT+S

If it doesn't have the "Text actions" icon (dashed square with paragraph lines in it), you can update it via windows store to get the latest version.

krick · 2024-03-16T13:32:56 1710595976

It's bugging me for a long time now. Is tesseract actually the state-of-the-art solution here?

I just really don't know, it feels like it's, uh, subpar. Isn't it? I never seriously worked in that domain, but it somehow felt to me in the 2019, that with all recent advancements in computer vision, text recognition must be essentially a solved problem. I'd expect it to be better than human. Yet I still cannot accurately convert a low-res scan (scan! not even a photo!) of a receipt with tesseract, especially if it isn't in English. Maybe I just cannot properly use it?

Gormo · 2024-03-18T13:43:51 1710769431

I use Tesseract semi-regularly and only rarely have recognition issues, including with receipt scans (or even photos). How are you specifically using it?

usr1106 · 2024-03-15T09:06:44 1710493604

I see tesseract mentioned more and more.

Myself I tried it probably 10-15 years ago on scanned scientific papers (decent scanning quality). The results were disappointing. The manual postprocessing required was not much less than typing it directly. So tesseract became a synonym of "not worth trying" to me.

Maybe things have improved over the years, so I should give it a new try. (No particular use case at the moment, but those tend to appear occasionally.)

graynk · 2024-03-15T12:56:36 1710507396

It’s good now _if_ you OCR only scanned documents or otherwise have a lot of control over how you prepare the images before it’s OCR’ed. For more general purpose recognition with weird fonts and bad image quality EasyOCR gave me much better results

sp332 · 2024-03-15T16:38:05 1710520685

This project is including Tesseract 4.1.1 which is at least a couple years old.

mellutussa · 2024-03-15T09:33:06 1710495186

Try https://github.com/ocrmypdf/OCRmyPDF - it uses Tesseract behind the scenes and it absolutely brilliant.

mkl · 2024-03-15T10:05:44 1710497144

It's way better now. I used it 15 years ago and had to do quite a bit of preprocessing to get not-entirely-terrible results, but now I use it with great success and no preprocessing.

walteweiss · 2024-03-15T09:34:34 1710495274

First time I used it 3 to 4 years ago, it was good.

jchw · 2024-03-15T10:28:10 1710498490

I gave it a try. Works pretty good.

Being a Flatpak app, it will require desktop portals to fully work. That said, it worked absolutely fine out of the box for me with my existing xdg-desktop-portal-wlr setup. So, it should work fine in any X11 or Wayland setup where you have an xdg-desktop-portal setup that supports the Screenshot API.

The results are mixed, but not bad by any means. Cleanly readable text comes out mostly fine with maybe only whitespace issues and the occasional error, which makes this still potentially very useful for copying text out of error dialogs and whatnot. (Though, I've found that on Linux, error dialogs are far more likely to have selectable text in the first place. And on Windows, standard MessageBox responds to Ctrl+C.)

rvdca · 2024-03-15T07:37:27 1710488247

The similar app I am using is Frog (https://getfrog.app) with great sucesss.

mathfailure · 2024-03-15T11:11:36 1710501096

No AppImage, no .deb, not even brew.

ssernikk · 2024-03-15T20:28:38 1710534518

It's on nixpkgs under name `gnome-frog` (for nix users)

schappim · 2024-03-15T03:41:38 1710474098

There is a utility available for macOS that extends beyond simply opening a document in Preview and attempting to select the text: https://github.com/schappim/macOCR

I like the author.

lelandfe · 2024-03-15T03:46:54 1710474414

FWIW one can skip Preview and just do Cmd-Shift-3, click the thumbnail, and interact with the text in the quicklook. Then, delete the image (trashcan in top right). Cmd-A works, too. Here's me using it on that comment: https://imgur.com/a/q0NvcS6

helsinkiandrew · 2024-03-15T08:09:46 1710490186

Thank You!

xiwenc · 2024-03-15T05:37:37 1710481057

I got a similar solution on iOS as a shortcut connected to the action button. Some apps doesn’t allow easy text copy. Or when it’s in a foreign language. It does:

- take screenshot

- extract text from it

- translate the text to english. Auto detects source language

- show both original and translated text in quick view where you can select and copy if desired.

Here is a implementation you can try:

https://www.icloud.com/shortcuts/f420d24e4960415da1a43f230ab...

While on the subject of iOS. In recent versions when you open a photo in the photos app you can also select the text in the photo by hand and copy it.

sunnybeetroot · 2024-03-15T06:00:10 1710482410

This is fantastic thanks for sharing. I have used it in the share sheet when tapping share on an image and it works but given I am already providing the image, the screenshot is redundant.

vmoore · 2024-03-15T04:01:38 1710475298

> for the Linux Desktop

Caveat: This is a Flatpak and not all Linux distros ship with Flatpak. But I'll give it a whirl in my Fedora virtual machine. I've seen many flavors of this type of tool floating about, most of them leveraging Tesseract[0], and I've tried a few of them. It fails badly on grainy / noisy images or where the text is warped or skewed in some way. It will not solve CAPTCHAs for you!

[0] https://tesseract-ocr.github.io/tessdoc/Home.html

Retr0id · 2024-03-15T06:09:26 1710482966

Which distros does flatpak not work on?

SushiHippie · 2024-03-15T13:53:13 1710510793

https://flatpak.org/setup/

Flatpak should work on every distro. However, it may not be included by default, so you need to install flatpak before installing this application.

adhamsalama · 2024-03-15T08:36:08 1710491768

Hannah Montana.

ChocolateGod · 2024-03-15T08:49:07 1710492547

I'm sure it would work if you built it from source.

akho · 2024-03-15T12:04:36 1710504276

...it's a pile of Vala code. What you probably mean is that the author did not make a package for your distribution, and there is no one else who had the time and inspiration to package it. You can be the maintainer you seek.

ChocolateGod · 2024-03-15T08:49:52 1710492592

Not really a Caveat, if it only had an Deb you could argue it doesn't work on non-Ubuntu/Debian, which is far bigger caveat.

rounakdatta · 2024-03-15T06:16:27 1710483387

Interesting, I've always resorted to using Google Lens via the phone for this purpose. And then using the "Copy to another device" feature of Chrome.

askvictor · 2024-03-15T09:25:43 1710494743

Would be great if flameshot had this feature. It's otherwise the best screenshot tool I've ever come across

gpuhacker · 2024-03-15T07:27:10 1710487630

Surprises me to see I'm the first comment here to say: I just use GPT4 for this. Works perfectly, even for getting the Latex source of a formula you only have a screenshot of.

Probably quite the overkill in terms of energy efficiency for just image to text, but I only need this like once every two weeks or so.

askl · 2024-03-15T12:03:40 1710504220

I'm using normcap[1] for this. The workflow feels a bit more polished (Though also not perfect) and the repo is still active.

[1] https://github.com/dynobo/normcap

dobicinaitis · 2024-03-15T14:17:20 1710512240

Another variant of the scripts floating around that I've been using to scratch the same itch:

  #!/bin/bash
  # Performs Optical Character Recognition (OCR) on a freely chosen
  # screen area and copies the recognized text to the clipboard.
  #
  # Dependencies: sudo apt install gnome-screenshot tesseract-ocr xclip
  
  IMAGE_FILE="/tmp/ocr.png"
  
  gnome-screenshot --area --file "$IMAGE_FILE"
  tesseract "$IMAGE_FILE" - | xclip -rmlastnl -selection clipboard
  
  rm -f "$IMAGE_FILE"

littlestymaar · 2024-03-15T09:12:13 1710493933

Why is this posted now? The repo has seen no activity in the past two years, and the https certificate on the website is also obsolete since 2022 so I'm not sure it is still alive…

Pokerface777 · 2024-03-15T11:21:48 1710501708

there's probably a lot of software that wasn't updated in the last 10 years that could still be really useful

littlestymaar · 2024-03-15T11:48:41 1710503321

Sure, but at the same time if it's not included on distributions and not updated by upstream it's likely to have compatibility issues relatively quickly (GTK is particularly bad at maintaining compatibility between versions, even point releases).

Also code not being updated is something, TLS certificate not being renewed is in an other league in terms of lack of support of the project.

Pokerface777 · 2024-03-15T14:27:35 1710512855

fair points... but I feel like that it is problems that needs to be solved. Emulators and VMs do fix some of those.

NegativeLatency · 2024-03-15T21:14:59 1710537299

Been using this little script for mac os to copy text out of images without going through Preview.app: https://github.com/nburns/utilities/blob/main/ocr

(would definitely appreciate feedback/critiques from any swifties out there)

Zambyte · 2024-03-15T05:04:44 1710479084

I wrote a script a while back gluing wofi (for dispatching several screenshot related tasks), grim, and tesseract together.

https://robbyzambito.me/posts/tips-and-tricks-for-taking-scr...

yjftsjthsd-h · 2024-03-15T05:39:16 1710481156

Suggestion: You could just

    mkdir -p ~/Pictures/Screenshots

and not have to warn the user to create it.

rhettbull · 2024-03-15T20:20:40 1710534040

For macOS users, I'm the author of Textinator [0] a similar utility for macOS that uses Apple's Vision framework [1] for doing the OCR natively. Modern versions of macOS (since Sonoma) have a similar ability to copy text from images using the Live Text feature [2] but Textinator works on macOS Catalina and later and simplifies the "take screenshot, copy text to clipboard" workflow. It's also an example of how to build a native macOS app entirely in Python.

[0]: https://github.com/RhetTbull/textinator

[1]: https://developer.apple.com/documentation/vision?language=ob...

[2]: https://support.apple.com/guide/preview/interact-with-text-i...

imhoguy · 2024-03-15T10:46:28 1710499588

Is there anything that could handle indentation? I use very similar tool on Linux already (also available on Windows and Mac): https://dynobo.github.io/normcap/

stevenicr · 2024-03-17T17:57:33 1710698253

was hoping to find a replacement for https://translate-image.com/

which promises to grab text from an image and then translate and make new image with same style..

But I've tried it many times with different images this week and keep getting same error.

Could be that Canva has this and I've not aware of it yet.

I've needed this this week, but may just open up affinity designer and make one manually at this point.

lacoolj · 2024-03-15T16:17:31 1710519451

thats pretty cool but definitely has a ways to go (in the example on github page even shows a few discrepancies between original and pasted text - seems to be mostly punctuation though)

very nice though thanks for sharing!

sp332 · 2024-03-15T16:33:23 1710520403

Looks like it hasn't veen updated in a couple years.

osbkca · 2024-03-15T06:44:20 1710485060

A great tool. But it only use on Linux. I found Xclippy (https://xclippy.com/) tool. It available on Windows and MacOs

crooked-v · 2024-03-15T06:56:11 1710485771

Note that this is alreay a (non-obvious) built-in feature on Mac and iPhone, called "Live Text". See these articles for examples:

https://support.apple.com/guide/preview/interact-with-text-i...

https://support.apple.com/guide/photos/interact-with-text-in...

https://support.apple.com/en-us/HT212630

LudwigNagasena · 2024-03-15T07:57:06 1710489426

On iPhone you can even search your images by text content.

dotancohen · 2024-03-15T08:24:07 1710491047

For completeness sake, Samsung phones with the S-Pen can also OCR. That would be, the old Note series and now the S-Ultra phones.

Pokerface777 · 2024-03-15T15:12:04 1710515524

very bad UI

dark-star · 2024-03-15T16:17:48 1710519468

It seems like this tool sends your screenshot to some sort of web service.

If that's really the case then obviously don't use it for personal data (invoices, love letters, legal proceedings, ...)

jchw · 2024-03-15T17:21:49 1710523309

I didn't see anything of the sort looking through the source code. I see it uses portals (or scrot) to take a screenshot, and spawns Tesseract as an external process.

https://github.com/RajSolai/TextSnatcher/blob/9e67760d6c16ea...

Tesseract itself seems to be included in the Flatpak as you'd expect:

https://github.com/RajSolai/TextSnatcher/blob/master/manifes...

Where did you get that?

sp332 · 2024-03-15T16:29:50 1710520190

Why would it be using Tesseract if it also uses an external service? And who's paying for the service?

Dwedit · 2024-03-15T09:21:12 1710494472

How is OCR these days? Lately I'm seeing more deeplearning-based OCR, and it gives you significantly different results just by cropping the image differently.

jbverschoor · 2024-03-15T09:49:26 1710496166

Preview on macOS does it automatically. No tools needed.

carlesfe · 2024-03-15T10:50:21 1710499821

Preview is one of the most underrated apps in macOS, and the one I miss the most when I use Linux or Windows. It's a great little toolbox for quick editing and convenience features.

jjice · 2024-03-15T13:10:08 1710508208

I didn't realize it was underrated, but it's probably the best MacOS bundled software. If I could get a Linux equivalent, that would be fantastic. Viewing, editing, PDF filling, PDF signing (so useful), all in a fast and responsive tool is just incredible.

If anyone has anything near a Linux equivalent, please let me know.

jbverschoor · 2024-03-18T13:11:28 1710767488

Except for the color and font picker. Omg so slow

gerardnico · 2024-03-15T08:01:13 1710489673

Within Google photos mobile on iOS, you got an ocr.

I take a photo, grab the text and send it via WhatsApp web app.

Not easy as a clip to clipboard but I haven’t found any on windows.

talhah · 2024-03-15T08:33:00 1710491580

If you're on a Pixel 7 and upwards or the latest Samsung phones there's also circle to search by holding the home button down. The OCR works quite well including English, Russian, Arabic, Japanese and I'm sure it works on other languages too.

If you're on Android 14 you can also copy text through the recents/overview menu simply by highlighting the text. And finally there's Google Photos if you don't have any of these features.

There's also Google lens if you're trying to copy text that isn't on your screen.

passion__desire · 2024-03-15T09:17:21 1710494241

You can take a screenshot. Open that image file with Chrome and then do "Search Images with Google" . There you can grab the text.

zuhsetaqi · 2024-03-15T08:24:51 1710491091

On iOS it's system wide. It's an iOS and also a macOS feature

noselasd · 2024-03-15T09:02:50 1710493370

Another neat thing is when you copy the text on your iOS device, it appears in your clipboard on Mac, so you can just paste it. (Assuming both devices are on the same wifi/local network.

walteweiss · 2024-03-15T09:43:06 1710495786

And you need to select the check box somewhere in the settings app for it to work.

Source: helped a friend with the feature recently, he didn’t know it exists.

ddtaylor · 2024-03-15T09:17:26 1710494246

It's a shame this is for elementaryOS as those apps typically do not work correctly on other basic Gnome systems.

makach · 2024-03-15T20:03:27 1710533007

Excellent, just like text extractor in windows powertoys! Love this!

mergy · 2024-03-15T15:23:03 1710516183

Compiled on Deb 12.5.x - pretty cool. Thank you.

avipars · 2024-03-15T20:11:34 1710533494

Does this work in languages besides english?

igtztorrero · 2024-03-15T11:59:53 1710503993

Vala , first time I hear about it !!!

nathansherburn · 2024-03-15T13:29:33 1710509373

Just three hours ago I switched back to Linux after a few years on MacOS. The only thing missing was the amazing text copy tool I was using, "Rex" [1]. What a coincidence to see this post on the front page a few hours later!

Side note, what a breath of fresh air Gnome on Fedora is!

[1] https://github.com/amebalabs/TRex

freedomben · 2024-03-15T15:56:34 1710518194

It's obviously personal opinion, but I think you made the best choice! (Gnome on Fedora). Welcome back!

It's remarkable how much more polished Gnome is from a few years ago. If you use 2FA TOTP, make sure to install Gnome Authenticator if you haven't already. If you use Aegis on Android (or a handful of other formats) it can import/export your seeds. It is downright luxurious having this on my laptop/desktop:

    # If you haven't setup flathub yet
    sudo flatpak remote-add --if-not-exists flathub https://flathub.org/repo/flathub.flatpakrepo
    
    # Install Authenticator from flathub.  Source:  https://gitlab.gnome.org/World/Authenticator
    sudo flatpak install flathub com.belmoussaoui.Authenticator

BHSPitMonkey · 2024-03-15T19:24:48 1710530688

You can also add TOTP secrets to entries in KeePassXC and generate/copy codes there (Ctrl+T).

fnord77 · 2024-03-15T14:54:55 1710514495

On a side note, sonoma and ios have this functionality built in now.

nico · 2024-03-15T15:38:09 1710517089

I'm running Monterey and have the feature, but it's only inside Preview, which means I need to either open the image in Preview or take a screenshot and then open a new image in Preview to paste the screenshot, before getting the text

rhettbull · 2024-03-15T22:31:20 1710541880

Check out Textinator [0] which is an open source macOS app that watches for screenshots and automatically does text detection then copies text to clipboard. (Disclaimer: I'm the author). It works on macOS 10.15+

[0]: https://github.com/RhetTbull/textinator

throwaway290 · 2024-03-15T16:44:13 1710521053

should work in quicklook and safari at least since Ventura. Does for me