Page 1 of 1

set marker for searchable/non searchable PDF files

Posted: 03.03.2021, 11:26
by mgroen
This is a feature request:
make FreeCommander able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)

Explanation:
There are 2 types of PDF files:
1. PDFs which are searable (either by creation of virtual printer), or by OCR (Optical Character Recognition) application on a image based PDF file
2. PDFS which are NON searchable (basically only contain an image). These are not searchable.

I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.

The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.

I just noticed another Totalcommander (another filemanager) has a plugin that has this functionality. More info here: https://totalcmd.net/plugring/pdfOCR.html
Although am I an not really a fan of Totalcommander I was hoping this functionality can be implemented in FreeCommander.

Thanks,
Mathijs

Re: set marker for searchable/non searchable PDF files

Posted: 06.03.2021, 14:58
by mgroen
mgroen wrote: 03.03.2021, 11:26 This is a feature request:
make FreeCommander able to set a marker so that the user is able to distinguish searchable / non searchable PDF files from each other?
For example, by creating a column "searchable" and put an S (or any marker) on each line for searchable PDF files?
(and not set a marker for non searchables)

Explanation:
There are 2 types of PDF files:
1. PDFs which are searable (either by creation of virtual printer), or by OCR (Optical Character Recognition) application on a image based PDF file
2. PDFS which are NON searchable (basically only contain an image). These are not searchable.

I have lots of PDF files, some of them are searchable, some not.
I want to get an overview of searchables/non searcables, not by opening each file manually and check if its searchable.

The main goal behind this question is that I have lots of pdf files and I want to make them all searchable but to do that I first need to have an overview of which pdf files are already searchable and which not.

I just noticed another Totalcommander (another filemanager) has a plugin that has this functionality. More info here: https://totalcmd.net/plugring/pdfOCR.html
Although am I an not really a fan of Totalcommander I was hoping this functionality can be implemented in FreeCommander.

Thanks,
Mathijs
To make it more clear what I need, I made a screenshot.

In short again: I need an overview of files with filenames and a mark/display if PDF file is searchable or not.

here is what I need:

Image
png plaatjes

Re: set marker for searchable/non searchable PDF files

Posted: 02.01.2023, 19:23
by mgroen
Any update on this? Anyone?

Note:
basically, in above thread, "searchable" means the PDF has been (partially) OCRed (processed with OCR software)

Re: set marker for searchable/non searchable PDF files

Posted: 02.01.2023, 20:10
by horst.epp
I use one of the xpdf-tools in a batch which poduces a list of PDFs which need an OCR.

Code: Select all

@echo off
setlocal
rem echo on
pushd "%~dp0"
cls
::____________________________________________________________
::
::				SETTINGS
::____________________________________________________________
::
	chcp 1252
	set OUT-List=.\need_ocr.txt
	del %OUT-LIST%

::____________________________________________________________
::
::				ACTION!
::____________________________________________________________
::

	for %%X in (*.pdf) do (
		echo.    [%%X]
		C:\Tools\xpdf-tools\pdftotext.exe -simple "%%X" .\checkthis.txt
		for %%C in (checkthis.txt) DO if %%~zC LSS 25 ( echo %~dp0%%X>>"%OUT-List%" )
		del checkthis.txt
	)
pause
goto :EOF


Re: set marker for searchable/non searchable PDF files

Posted: 03.01.2023, 10:29
by mgroen
@horst.epp how to implement this?

Re: set marker for searchable/non searchable PDF files

Posted: 03.01.2023, 14:04
by horst.epp
mgroen wrote: 03.01.2023, 10:29 @horst.epp how to implement this?
In short:
Download the free XPDF command line tools from
https://www.xpdfreader.com/download.html
Use my batch as a template and change according to your environment.
Create a FC favorite button to invokde the batch with the current dir as paremeter.