Skip to content

fortytools/docsplit

This branch is 6 commits ahead of, 9 commits behind documentcloud/docsplit:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

e8e8941 · May 26, 2023
Dec 4, 2009
May 26, 2023
Nov 17, 2014
Nov 18, 2011
Jul 29, 2010
Jan 19, 2014
Dec 7, 2009
May 29, 2014
Nov 17, 2014
Feb 5, 2015
Nov 22, 2014

Repository files navigation

==
         __                      ___ __ 
    ____/ /___  ______________  / (_) /_
   / __  / __ \/ ___/ ___/ __ \/ / / __/
  / /_/ / /_/ / /__(__  ) /_/ / / / /_  
  \____/\____/\___/____/ .___/_/_/\__/  
                      /_/
                      
  Docsplit is a command-line utility and Ruby library for splitting apart
  documents into their component parts: searchable UTF-8 plain text, page 
  images or thumbnails in any format, PDFs, single pages, and document 
  metadata (title, author, number of pages...)
  
  Installation:
  gem install docsplit
  
  For documentation, usage, and examples, see:
  http://documentcloud.github.com/docsplit/
  
  To suggest a feature or report a bug: 
  http://github.com/documentcloud/docsplit/issues/

About

Break Apart Documents into Images, Text, Pages and PDFs

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Ruby 71.1%
  • HTML 28.9%