Programming Tutorials Browser Tutorials Articles Struts Tutorials Hibernate Tutorials

  Tutorial: PDFTextStream v2.0 Released - Adds CJK Support, New Unstructured Content Extraction Tools

We are very proud to introduce you to PDFTextStream v2.0

Tutorial Details:

One of the most frequently asked-for "features" was that PDFTextStream should be available for platforms other than Java. And, while PDFTextStream's origins are rooted firmly in the Java world, we have found ways to bring PDFTextStream to two more very popular, very powerful platforms, .NET and Python.

PDFTextStream.NET and PDFTextStream.Python don't ask you to compromise: you get all of the performance, functionality, robustness, and features provided by PDFTextStream for Java, but on the development platform of your choice. Rarely is life this kind.

Today's is a global marketplace, and surviving means being able to work well with others, regardless of their location or language. So, it is fitting that PDFTextStream v2.0 now supports extracting Chinese, Japanese, and Korean (CJK) text from PDF documents (as well as other less well known double-byte character sets).

PDFTextStream v2.0's CJK text extraction capabilities aren't half baked or bolted-on; they've been built into the library at the lowest levels. This enables PDFTextStream v2.0 to:

* properly extract CJK text written horizontally as well as vertically
* properly segment and order chunks of CJK text on each page of a PDF (rather than blindly outputting the CJK text in the order it is encoded in the PDF file, as is done by some PDF extraction libraries)
* do all of this with no performance penalty or additional cost to you


 

Read Tutorial at: Click here to view the tutorial

Rate Tutorial:
PDFTextStream v2.0 Released - Adds CJK Support, New Unstructured Content Extraction Tools

View Tutorial:
PDFTextStream v2.0 Released - Adds CJK Support, New Unstructured Content Extraction Tools

Related Tutorials:

The Java HotSpot Performance Engine is set to break new records - JavaWorld
The Java HotSpot Performance Engine is set to break new records - JavaWorld
 
What's new in Java Servlet API 2.2? - JavaWorld October 1999
What's new in Java Servlet API 2.2? - JavaWorld October 1999
 
XML for the absolute beginner - JavaWorld - April 1999
XML for the absolute beginner - JavaWorld - April 1999
 
A ZipClassLoader for automated application distribution - JavaWorld April 2000
A ZipClassLoader for automated application distribution - JavaWorld April 2000
 
The Lucene search engine: Powerful, flexible, and free - JavaWorld September 2000
The Lucene search engine: Powerful, flexible, and free - JavaWorld September 2000
 
JSP templates - JavaWorld September 2000
JSP templates - JavaWorld September 2000
 
Solve your servlet-based presentation problems - JavaWorld November 2000
Solve your servlet-based presentation problems - JavaWorld November 2000
 
Check out three collections libraries
Check out three collections libraries
 
Enhance your J2EE presentation layer
Enhance your J2EE presentation layer
 
Real World HTML Parser
Real World HTML Parser The two fundamental use-cases that are handled by the parser are extraction and transformation (the syntheses use-case, where HTML pages are created from scratch, is better handled by other tools closer to the source of data). Whil
 
Sphinx-4 - A speech recognizer
Sphinx-4 - A speech recognizer General Information about Sphinx-4 Introduction Sphinx-4 is a state-of-the-art speech recognition system written entirely in the JavaTM programming language. It was created via a joint collaboration between the Sphinx
 
YourKit Java Profiler 2.5.2 Released
YourKit Java Profiler 3.2 Released With help of YourKit Java Profiler, an outstanding tool for Java professionals, you can easily solve wide range of CPU and memory related performance problems in J2EE and J2SE applications.
 
PDFTextStream v1.1.2 Released; Fast Java PDF Text Extraction
PDFTextStream is the ideal solution for Java applications and J2EE web services that need to rapidly and accurately extract text and document metadata from PDF files.
 
YourKit Java Profiler 3.2 Released
With help of YourKit Java Profiler, an outstanding tool for Java professionals, you can easily solve wide range of CPU and memory related performance problems in J2EE and J2SE applications.
 
SOFIA 2.3 released with Hibernate and Dynamic HTML components
SOFIA 2.3 released with Hibernate and Dynamic HTML components
 
Sony Ericsson Developer World Program
Learn at-a-glance about the Sony Ericsson Developer World global support program and web portal totally dedicated to helping serious wireless developers achieve business success by providing the documentation, tools, training, technical and support.
 
Sony Ericsson Developer World Program
Learn at-a-glance about the Sony Ericsson Developer World global support program and web portal totally dedicated to helping serious wireless developers achieve business success by providing the documentation, tools, training, technical and go-to-market s
 
TheOpenCD is a collection of high quality Free and Open Source Software
TheOpenCD is a collection of high quality Free and Open Source Software TheOpenCD v2.0 Now Available TheOpenCD v2.0 TheOpenCD is a collection of high quality Free and Open Source Software. The programs run in Windows and cover the most common
 
NetBeans IDE 4.1
Out-of-the-box support for J2EE 1.4 and Web Services. Check out what early access release 2 can do for you!
 
Chat Transcript: Java Web Services Developer Pack (Java WSDP) 1.5
Learn about the exciting new web services features in the recently-released Java WSDP 1.5.
 
Site navigation
 

 

Send your comments, Suggestions or Queries regarding this site at roseindia_net@yahoo.com.

Copyright © 2006. All rights reserved.