mupdf - MuPDF PDF reader and library

Age	Commit message (Collapse)	Author
2015-07-31	win32: Convert argv to utf-8 and use regular getopt.	Tor Andersson
	Easier than duplicating getopt for wchar_t, since we already have windows specific functions to convert wchar_t strings.
2015-07-29	Add support for parsing GIF images.	Sebastian Rasmussen

2015-07-29	Support reading LZW codes in reverse order from each byte.	Sebastian Rasmussen

2015-07-29	Support LZW codes shorter than 9 bits.	Sebastian Rasmussen

2015-07-27	Limit dash phase to length of dash pattern.	Sebastian Rasmussen
	Previously out of range phase values were accepted which led to overly long loops when processing moveto. This could be triggered e.g. by 2222222222222222222 [ 4 6 ] 0 d in a content stream.
2015-07-27	Cosmetic rearrangement of code for drawing paths.	Sebastian Rasmussen
	In preparation for a bugfix concerning path phase.
2015-07-26	Ignore excessive output from PDF functions.	Sebastian Rasmussen
	Previously all output values were used, causing buffer overflows. Fixes one issue from bug 696012.
2015-07-20	Only include gproof document handler if SUPPORT_GPROOF is defined.	Robin Watts

2015-07-20	Improve Grid fitting of images for .gproof files.	Robin Watts
	By default in MuPDF, when we render an axis aligned image, we 'gridfit' it. This is a heuristic used to improve the rendering of tiled images, and avoid the background showing through on the antialiased edges. The general algorithm we use is to expand any image outwards so that it completely covers any pixels that it touches any part of. This is 'safe' in that we never cause any pixels to not be covered that should otherwise be so, and is important when we have images that are aligned with (say) line art rectangles. For gproof files though, this gives nasty results - because we have multiple images tiled across the page all exactly abutting, in most cases the edges will not be on exact integer coordinates. This means we expand both images and 1 (destination) pixel is lost. This severely hurts the rendering (in particular on text based pages). We therefore introduce a new type of grid fitting, where we simply align the edges of images to the closest integer pixel. This is safe because we know that neighbouring images will be adjusted identically and edges will stay coincident. We enable/disable this behaviour through a new device flag, and make the gproof interpreter set/clear this flag when generating the page - thus normal rendering is unaffected. We could have just poked the dev->flags fields directly, but that would require magic in the display list device to check for them being set/unset and to poke the dev->flags fields on playback, so instead we introduce a new fz_render_flags function (that calls a device function) to set/unset flags. The other attraction of this is that if we ever have devices that 'filter', we can neatly handle passing flag changes on with those. Currently the display list implementation only copes with set/clear of the FZ_DEVFLAG_GRIDFIT_AS_TILED option. We only readily have 6 bits available to us, so we'll just extend this as required if we add new render flags.
2015-07-20	First cut at gprf document handler.	Robin Watts
	Doesn't actually trigger generation from ghostscript, or load images from files generated by ghostscript yet.
2015-07-20	Tweak fz_tempfile to include directory hint.	Robin Watts
	In android, we can't write to '.', and we don't have TMP defined. Therefore use the path of the supplied file as a hint.
2015-07-20	Fix leak during text extraction.	Robin Watts
	MuPDF (the win32/linux viewer) leaks a span_soup each time it is run, even if (seemingly to the user) no text extraction operations are done. This is because the view does a text extraction pass silently, during which 'begin_page' is called for both page contents and annotation contents. This causes a leak of a span_soup. Change the implementation to allocate the span_soup just in time instead.
2015-07-20	Enable fz_images to have NULL buffers, and still be decoded.	Robin Watts
	Important for gproof files.
2015-07-20	Fix leak in separations code.	Robin Watts
	Include code to free the list of separation names.
2015-06-29	Add Separation class to fitz.	Robin Watts
	Simple set of functions for managing sets of separations. Separations have names, equivalent rgb/cmyk colors, and can be enabled/disabled.
2015-06-29	Further tweaks to fz_image handling.	Robin Watts
	Ensure that subsampling and caching happen in the generic image code, not in the specific. Previously, the subsampling happened only for images that were decoded from streams. Images that were loaded direct were never subsampled and hence were always cached at full size. After this change both classes of image are correctly subsampled, and the subsampled version kept in the cache. This produces various image diffs in the cluster, none of which are noticable to the naked eye.
2015-06-29	Rejig the internals of fz_image slightly.	Robin Watts
	Previously, we had people calling image->get_pixmap directly. Now we have them all call fz_image_get_pixmap, which will look for a cached version in the store, and only call get_pixmap if required. Previously fz_image_get_pixmap used to look for the cached version in the store, and decode if not - hence the decoding code is now extracted out into standard_image_get_pixmap. This was the original intent of the code, it just somehow didn't end up like that. This nicely queues us up for being able to have fz_images that use a different get_pixel implementation, such as that which will be required for the gprf code.
2015-06-29	Add an fz_tempfile utility.	Robin Watts
	This will be required for the gprf work.
2015-06-26	Add stream functions for reading LE values of different sizes	Robin Watts
	fz_read_int16le, fz_read_int32le, fz_read_int64le.
2015-06-26	Bug 696053: Update windows mupdf to respect command line flags.	Robin Watts
	Previously, only the unix executable had been updated to take command line flags; update the windows one in line with it. We have to cope with the argv being in Unicode; add a windows specific version of getoptw for this. Also note that that fprintf's in the windows mupdf exe won't work as GUI apps don't have a console window, and can't write to the parent one. Fixing that is a larger project than I have time for right now.
2015-05-25	Style context reference should be 1 after creation	Sebastian Rasmussen

2015-05-19	Add locks to fz_set_device_xxx colorspace context functions.	Tor Andersson

2015-05-19	epub: User stylesheets.	Tor Andersson
	Add -U option to mupdf and mudraw to set a user stylesheet. Uses a context to store user the stylesheet, just like the AA level.
2015-05-15	Memento improvements.	Robin Watts
	Firstly, when displaying a list of nested blocks, don't suppress outputting a block just because it contains a pointer to itself. Various valgrind fixes from the gs version of memento. Experimental C++ operators. See writeup in memento.h comments for how to integrate.
2015-05-15	Support pdf files larger than 2Gig.	Robin Watts
	If FZ_LARGEFILE is defined when building, MuPDF uses 64bit offsets for files; this allows us to open streams larger than 2Gig. The downsides to this are that: * The xref entries are larger. * All PDF ints are held as 64bit things rather than 32bit things (to cope with /Prev entries, hint stream offsets etc). * All file positions are stored as 64bits rather than 32. The implementation works by detecting FZ_LARGEFILE. Some #ifdeffery in fitz/system.h sets fz_off_t to either int or int64_t as appropriate, and sets defines for fz_fopen, fz_fseek, fz_ftell etc as required. These call the fseeko64 etc functions on linux (and so define _LARGEFILE64_SOURCE) and the explicit 64bit functions on windows.
2015-05-14	Move away from file descriptors to FILE *'s.	Robin Watts

2015-05-05	Ignore ENTITY declarations in XML.	Tor Andersson

2015-05-05	epub: Decode URL escapes in epub paths.	Tor Andersson

2015-05-05	Fix typo in fz_pack_path that caused us to malloc much more than needed.	Tor Andersson

2015-04-14	Split fz_meta into separate querying functions.	Tor Andersson
	Add fz_has_permission function to fz_document. Add fz_lookup_metadata function to fz_document. Remove fz_meta function from fz_document.
2015-04-09	Remove the _no_run functions.	Tor Andersson
	The new pdfclean sanitize functionality mean that mutool now needs the data files, so maintaining the split that was designed to keep data files out of mutool is no longer viable.
2015-04-07	Fix whitespace.	Tor Andersson

2015-04-07	Fix some warnings.	Tor Andersson

2015-04-07	Trigger default layout in fz_document layer.	Tor Andersson
	Trigger the default layout when needed, but only if no manual layout has been done. This avoids doing a pointless double layout (once with default when loading the document, then with the manual layout call with the desired layout options).
2015-04-07	Fix structured text extraction in vertical mode.	Robin Watts
	When advancing a glyph in vertical mode, it should advance down the page. The origin of the glyph as supplied is bottom left, not top right - allow for this in calculations. Previously glyphs were not being collated into spans because of this.
2015-04-07	Structured text extraction; improve glyph bounding box calculations.	Robin Watts
	In vertical motion mode, when calculating bboxes we should use horizontal rather vertical displacements from the 'axis of movement'. In horizontal mode, we displace by 'ascender' and 'descender'. Those concepts don't rotate with the motion mode, so repurpose those fields to hold bbox.x0 and bbox.x1 in vertical mode.
2015-04-07	Use fz_advance_glyph rather than direct FT calls during PDF layout.	Robin Watts

2015-04-06	Add some simple debug code to dump fz_gels.	Robin Watts
	Just for internal use, no external interface.
2015-04-06	Antidropout followup: cope with stroking too.	Robin Watts
	Stroke segments that are horizontal or vertical get the same antidropout treatment as filled rectangles.
2015-04-06	Bug 694367: Attempt to avoid dropouts of rectangles.	Robin Watts
	This is not a complete general fix for features dropping out of rendered line art, but merely a fix for one of the more common cases. When rendering rectangles (currently, specifically only those rectangles that are actually defined as rectangles within the path structure), if they are axis aligned, then ensure that they always fill the subpixel line they are on.
2015-04-01	Clean up mudraw command line syntax.	Tor Andersson
	... and move outline printing to mutool show.
2015-03-30	Bug 695549: Avoid returning compressed buffer as uncompressed.	Robin Watts
	pdf_load_image_stream is supposed to return a buffer containing the uncompressed stream from an object (or, in the case of image streams where an fz_compression_params structure is supplied, a stream decompressed up to the point of the image format compression). We have an optimisation in pdf_load_image_stream to allow it to return the existing buffer from a cached object rather than reloading it again, but as bug 695549 points out, this breaks in the case where the cached stream is compressed. The suggested fix by the bug reporter (Stefan Klein) would work in that it would stop compressed streams being returned as uncompressed ones, but it is not perfect as it could lead to several copies of shortstoppable image streams being loaded (and for streams with null or empty array filters being mistaken for compressed ones). The fix here solves these cases too.
2015-03-30	Bug 695556: Use doubles when inverting matrices.	Robin Watts
	When inverting matrices, use doubles for inversion calculations. This prevents floats over/underflowing and causing stroked content to go missing.
2015-03-26	Avoid division by 0 in blend calls.	Robin Watts
	If b is out of range (-ve), then this can let s == 0 and we can get failures.
2015-03-25	Fix problems with fz_printf.	Robin Watts
	My recent changes to support %010Zd were broken in many interesting ways. Fixed here.
2015-03-24	Update our printf to cope with various useful extensions.	Robin Watts
	Ensure that %010d works. Ensure that we can output 64 bit values (%ll{d,u,x}). Ensure that we can output size_t and fz_off_t (%z{d,u,x} and %Z{d,u,x}). fz_off_t isn't defined yet (it will be introduced by a commit that depends on this one), so for now, we put a stub definition in printf.c that we will remove later.
2015-03-24	Path packing for memory efficiency.	Robin Watts
	Introduce the concept of 'packed' paths. These reduce the header overhead for most common paths (ones with less than 256 commands and 256 coords) to a single 32bit int once stored in the display list. The previous commit reduces the torture-test.pdf from 95 to 87Meg. This commit futher reduces it to 70Meg.
2015-03-24	Path rework for improved memory usage.	Robin Watts
	Firstly, we make the definition of the path structures local to path.c. This is achieved by using an fz_path_processor function to step through paths enumerating each section using callback functions. Next, we extend the internal path representation to include other section types, including quads, beziers with common control points rectangles, horizontal, vertical and degenerate lines. We also roll close path sections up into the previous sections commands. The hairiest part of this is that fz_transform_path has to cope with changing the path commands depending on the matrix. This is a relatively rare operation though.
2015-03-24	Don't pass interpreter context to pdf_processor opcode callbacks.	Tor Andersson
	Update buffer and filter processors. Filter both colors and stroke states. Move OCG hiding logic into interpreter.
2015-03-20	Workaround bug in ftoa.	Tor Andersson