mupdf - MuPDF PDF reader and library

Age	Commit message (Collapse)	Author
2012-09-04	Fix error message/comment typos.	Sebastian Rasmussen

2012-09-04	Prevent recursive image soft masks from being loaded	Sebastian Rasmussen
	Print a warning instead and ignore the soft mask
2012-08-06	Remove old error mesages turned into comments when adding exceptions	Sebastian Rasmussen

2012-08-06	Make sure images are freed upon loading error	Sebastian Rasmussen
	There was a call to fz_drop_image() in the error handling code, however the fz_storable freeing function was not yet set which meant that the image was not freed. Instead call pdf_free_image() to free the image.
2012-07-06	Improve bounds checking for image properties.	Sebastian Rasmussen

2012-07-06	Remove debugging functions for release builds.	Sebastian Rasmussen

2012-05-10	mupdfclean - update to allow renumbering of encrypted objects	Robin Watts
	mupdfclean (or more correctly, the pdf_write function) currently has a limitation, in that we cannot renumber objects when encryption is being used. This is because the object/generation number is pickled into the stream, and renumbering the object causes it to become unreadable. The solution used here is to provide extended functions that take both the object/generation number and the original object/generation number. The original object numbers are only used for setting up the encryption. pdf_write now keeps track of the original object/generation number for each object. This fix is important, if we ever want to output linearized pdf as this requires us to be able to renumber objects to a very specific order. We also make a fix in removeduplicateobjects that should only matter in the case where we fail to read an object correctly.
2012-05-03	Bug 693022: Image/Pixmaps stuck in cache	Robin Watts
	Zeniko points out that images that don't decode on demand (i.e. ones that are held as pixmaps all the time) can never be evicted from the cache due to them holding a pointer to the pixmap, which holds a pointer back to the image. His fix is to only cache images that decode on demand. The actual patch applied here is slightly tweaked from his version; firstly the 'dontcache' logic is reversed to 'cache' to avoid overloading my poor brain with another negation. Secondly, one change to a condition is not taken on, as it is (I believe) unnecessary.
2012-03-19	Bug 692746; avoid 'double palettes' on jpx images.	Robin Watts
	It seems that JPX images can be supplied in indexed format, with both a palette internal to the jpx stream, and a palette in the PDF. Googling seems to suggest that the internal palette should be ignored in this case, and the external palette applied. Fortunately, since OpenJPEG-1.5 there is a flag that can be used to tell OpenJPEG not to decode palettes. We update the code here to spot that there is an external palette, and to set this flag.
2012-03-06	Split fitz.h/mupdf.h into internal/external headers.	Robin Watts
	Attempt to separate public API from internal functions.
2012-03-06	Fix ref counting bugs in race condition correction code.	Robin Watts
	When we attempt to insert a key/value pair into the store, we have to allow for the possibility that a racing thread may have already inserted an equivalent key/value. We have special code in place to handle this eventuality; if we spot an existing entry, we take the existing one in preference to our new key/value pair. This means that fz_store_item needs to take a new reference to any existing thing it finds before returning it. Currently the only store user that is exposed to this possibility is pdf_image; it spots an existing tile being returned, and was inadvertently double freeing the key.
2012-03-01	Remove mask entry from fz_pixmap as never used any more.	Robin Watts
	Also, the attempts to keep it up to date were causing race conditions in multithreading cases.
2012-02-26	Move fz_obj to be pdf_obj.	Robin Watts
	Currently, we are in the slightly strange position of having the PDF specific object types as part of fitz. Here we pull them out into the pdf layer instead. This has been made possible by the recent changes to make the store no longer be tied to having fz_obj's as keys. Most of this work is a simple huge rename; to help customers who may have code that use such functions we have provided a sed script to do the renaming; scripts/rename2.sed. Various other small tweaks are required; the store used to have some debugging code that still required knowledge of fz_obj types - we extract that into a nicer 'type' based function pointer. Also, the type 3 font handling used to have an fz_obj pointer for type 3 resources, and therefore needed to know how to free this; this has become a void * with a function to free it.
2012-02-25	Fix assert/SEGVs seen in cluster due to using a color image as a mask.	Robin Watts
	When loading a JPX image with no specified colorspace, we were ending with image->colorspace being set to NULL. This caused us to treat the image as a mask. The correct fix is to inherit the colorspace from the jpx once loaded.
2012-02-25	Rework image handling for on demand decode	Robin Watts
	Introduce a new 'fz_image' type; this type contains rudimentary information about images (such as native, size, colorspace etc) and a function to call to get a pixmap of that image (with a size hint). Instead of passing pixmaps through the device interface (and holding pixmaps in the display list) we now pass images instead. The rendering routines therefore call fz_image_to_pixmap to get pixmaps to render, and fz_pixmap_drop those afterwards. The file format handling routines therefore need to produce images rather than pixmaps; xps and cbz currently just wrap pixmaps as images. PDF is more involved. The stream handling routines in PDF have been altered so that they can recognise when the last stream entry in a filter dictionary is an image decoding filter. Rather than applying this filter, they read and store the parameters into a pdf_image_params structure, and stop decoding at that point. This allows us to read the compressed data for an image into memory as a block. We can then restart the image decode process later. pdf_images therefore consist of the compressed image data for images. When a pixmap is requested for such an image, the code checks to see if we have one (of an appropriate size), and if not, decodes it. The size hint is used to determine whether it is possible to subsample the image; currently this is only supported for JPEGs, but we could add generic subsampling code later. In order to handle caching the produced images, various changes have been made to the store and the underlying hash table. Previously the store was indexed purely by fz_obj keys; we don't have an fz_obj key any more, so have extended the store by adding a concept of a key 'type'. A key type is a pointer to a set of functions that keep/drop/compare and make a hashable key from a key pointer. We make a pdf_store.c file that contains functions to offer the existing fz_obj based functions, and add a new 'type' for keys (based on the fz_image handle, and the subsample factor) in the pdf_image.c file. While working on this, a problem became apparent in the existing store codel; fz_obj objects had no protection on their reference counts, hence an interpreter thread could try to alter a ref count at the same time as a malloc caused an eviction from the store. This has been solved by using the alloc lock as protection. This in turn requires some tweaks to the code to make sure we don't try and keep/drop fz_obj's from the store code while the alloc lock is held. A side effect of this work is that when a hash table is created, we inform it what lock should be used to protect its innards (if any). If the alloc lock is used, the insert method knows to drop/retake it to allow it to safely expand the hash table. Callers to the hash functions have the responsibility of taking/dropping the appropriate lock, and ensuring that they cope with the possibility that insert might drop the alloc lock, causing race conditions.
2012-02-06	Pass context to cmap and font descriptor functions.	Tor Andersson

2012-02-03	Be consistent about passing a fz_context argument in pixmap functions.	Tor Andersson

2012-01-30	Move PNG, JPEG and TIFF image loading from muxps into fitz.	Tor Andersson

2012-01-27	Rename pdf_xref type to pdf_document.	Tor Andersson

2012-01-06	Ignore decode arrays for indexed jpx images.	Robin Watts
	We can't do them properly at the moment, so don't try to do them at all - only makes stuff worse for now. Thanks to Zeniko for this.
2011-12-23	Add some fz_vars to fix exception behaviour.	Robin Watts
	gcc 4.4.5 gives helpful warnings about variables that can become unset due to setjmp/longjmp usage. Fix that here. Thanks to Sebras.
2011-12-19	More Memory squeezing fixes	Robin Watts

2011-12-15	Remove 'soft limit' on pixmaps in favour of fz_store.	Robin Watts
	Change the fz_store to be limited to 256 Megs. Remove the soft limit for pixmaps; the store will automatically throw old resources away to stay below the limit.
2011-12-15	Rework pdf_store to fz_store, a part of fz_context.	Robin Watts
	Firstly, we rename pdf_store to fz_store, reflecting the fact that there are no pdf specific dependencies on it. Next, we rework it so that all the objects that can be stored in the store start with an fz_storable structure. This consists of a reference count, and a function used to free the object when the reference count reaches zero. All the keep/drop functions are then reimplemented by calling fz_keep_sharable/fz_drop_sharable. The 'drop' functions as supplied by the callers are thus now 'free' functions, only called if the reference count drops to 0. The store changes to keep all the items in the store in the linked list (which becomes a doubly linked one). We still make use of the hashtable to index into this list quickly, but we now have the objects in an LRU ordering within the list. Every object is put into the store, with a size record; this is an estimate of how much memory would be freed by freeing that object. The store is moved into the context and given a maximum size; when new things are inserted into the store, care is taken to ensure that we do not expand beyond this size. We evict any stored items (that are not in use) starting from the least recently used. Finding an object in the store now takes a reference to it already. LOCK and UNLOCK comments are used to indicate where locks need to be taken and released to ensure thread safety.
2011-12-08	Remove remaining fz_error_note calls in the pdf code.	Tor Andersson

2011-12-08	Move from volatile to fz_var.	Robin Watts
	When using exceptions (which are implemented using setjmp/longjmp), we need to be careful to ensure that variable values get written back before any exception happens. Previously we've done that using volatile, but that produces nasty warnings (and unduly limits the compilers freedom to optimise). Here we introduce a new macro fz_var that passes the address of the variable out of scope. This means that the compiler has to ensure that any changes to its value are written back to memory before calling any out of scope function.
2011-12-08	Fix SEGV with "1439 - color softmask fails to draw jpx image.pdf"	Robin Watts
	Another missed fz_rethrow. Also, ensure that fz_drop_buffer copes with NULL input.
2011-12-08	Merge branch 'master' into context	Robin Watts

2011-12-08	Ignore invalid color key Masks that contain non-integer values.	Tor Andersson

2011-11-25	Merge branch 'master' into context	Robin Watts

2011-11-17	Bug 692478: Honour decode arrays for jpx images	Robin Watts
	Add simple code to read decode array and apply it to a jpx image after loading. Solves bug.
2011-10-04	Establish 'break scope' within fz_try macro	Robin Watts
	It is vital that no one returns from within fz_try. As such it's often necessary to jump out of an fz_try. This can mean using a label at the end of the fz_try section to goto. By introducing a "do { } while (0)" around the contents of the fz_try we allow people to 'break' (or 'continue') to get out neatly.
2011-10-04	Move to exception handling rather than error passing throughout.	Robin Watts
	This frees us from passing errors back everywhere, and hence enables us to pass results back as return values. Rather than having to explicitly check for errors everywhere and bubble them, we now allow exception handling to do the work for us; the downside to this is that we no longer emit as much debugging information as we did before (though this could be put back in). For now, the debugging information we have lost has been retained in comments with 'RJW:' at the start. This code needs fuller testing, but is being committed as a work in progress.
2011-09-21	Add warning context.	Tor Andersson

2011-09-21	Rename malloc functions for arrays (fz_calloc and fz_realloc).	Tor Andersson

2011-09-21	Don't thread ctx through safe fz_obj functions.	Tor Andersson

2011-09-15	Add context to mupdf.	Robin Watts
	Huge pervasive change to lots of files, adding a context for exception handling and allocation. In time we'll move more statics into there. Also fix some for(i = 0; i < function(...); i++) calls.
2011-09-14	Initial import of exception handling code	Robin Watts
	Import exception handling code from WSS, modified to fit into the fitz world. With this code we have 'real' fz_try/fz_catch/fz_rethrow functions, handling a fz_except type. We therefore rename the existing fz_throw/ fz_catch/fz_rethrow to be fz_error_make/fz_error_handle/fz_error_note. We don't actually use fz_try/fz_catch/fz_rethrow yet...
2011-08-29	Check for and disallow image depths greater than 16.	Tor Andersson

2011-04-08	Add soft limit to pixmap allocation.	Tor Andersson
	All image loading functions call the new fz_new_pixmap_with_limit allocation function, which will return NULL if the total amount of pixmap memory would exceed a set limit. Other vital pixmap allocations which are not as easily recoverable (such as font rendering, and the various buffers in the draw device) ignore the limit.
2011-04-08	Simplify arguments to fz_new_pixmap.	Tor Andersson

2011-04-07	pdf: Purge unmaintained debug/log printing messages.	Tor Andersson

2011-04-06	Improve handling of JPX colorspaces.	Tor Andersson

2011-04-04	Le Roi est mort, vive le Roi!	Tor Andersson
	The run-together words are dead! Long live the underscores! The postscript inspired naming convention of using all run-together words has served us well, but it is now time for more readable code. In this commit I have also added the sed script, rename.sed, that I used to convert the source. Use it on your patches and application code.
2011-04-04	pdf: Rename mupdf directory.	Tor Andersson