Below you will find pages that utilize the taxonomy term “Office”
Ubuntu - paperless office on a budget
Since paper and myself have never gotten on well I have always been dreaming of a paperless office. A while ago I purchased a Fujitsu ScanSnap S1500 scanner for the office. I did this after doing some research on which Automatic Document Feed (ADF) multipage & duplex scanners were both affordable as well as supported on Linux.
- scan the document
- perform OCR to convert to text
- combine the text with PDF to create a searchable PDF
- OPTIONAL – send the resulting document into Alfresco Document Management Server via FTP
Install dependencies
NOTE: PPA is only required for support of Fujitsu ScanSnap S1500
sudo apt-add-repository ppa:rolfbensch/sane-git<br></br>sudo apt-get update<br></br>sudo apt-get install sane sane-utils imagemagick tesseract-ocr pdftk libtiff-tools libsane-extras exactimage wput
Bulk converting Office documents to PDF
When you need to convert multiple documents to PDF for distribution (or from one Office format to another) there are a few utilities around. The most workable I found is the UNOCONV utility which is build on top of LibreOffice / OpenOffice. This uses the OpenOffice conversion facilities rather than a simple PDF print driver.
On Ubuntu it can be installed via Software Center or via apt-get from the core repositories.
sudo apt-get install unoconv
Combined with the -exec option of the Unix find command this makes conversion of whole directory structures a breeze.
#find all Word Documents and convert to PDF<br></br>find . -name "*.doc*" -exec unoconv -f pdf {} \;<br></br>#find all Powerpoint Documents and convert to PDF<br></br>find . -name "*.ppt*" -exec unoconv -f pdf {} \;
To show all the possible conversion formats you can use:
unoconv --show
The following list of document formats are currently available: