mupdf - MuPDF PDF reader and library

Age	Commit message (Collapse)	Author
2012-02-11	Purge unused and bit rotted fz_accelerate stuff.	Tor Andersson

2012-02-08	Lock reworking.	Robin Watts
	This is a significant change to the use of locks in MuPDF. Previously, the user had the option of passing us lock/unlock functions for a single mutex as part of the allocation struct. Now we remove these entries from the allocation struct, and make a separate 'locks' struct. This enables people to use fz_alloc_default with locking. If multithreaded operation is required, then the user is required to create FZ_LOCK_MAX mutexes, which will be locked or unlocked by MuPDF calling the lock/unlock functions within the new fz_locks_context structure passed in at context creation. These mutexes are not required to be recursive (they may be, but MuPDF should never call them in this way). MuPDF avoids deadlocks by imposing a locking ordering on itself; a thread will never take lock n, if it already holds any lock i for which 0 <= i <= n. Currently, there are 4 locks used within MuPDF. Lock 0: The alloc lock; taken around all calls to user supplied (or default) allocation functions. Also taken around all accesses to the refs field of storable items. Lock 1: The store lock; taken whenever the store data structures (specifically the linked list pointers) are accessed. Lock 2: The file lock; taken whenever a thread is accessing the raw file. We use the debugging macros to insist that this is held whenever we do a file based seek or read. We also insist that this is never held when we resolve an indirect reference, as this can have the effect of moving the file pointer. Lock 3: The glyphcache lock; taken whenever a thread calls freetype, or accesses the glyphcache data structures. This introduces some complexities w.r.t type3 fonts. Locking can be hugely problematic, so to ease our minds as to the correctness of this code, we introduce some debugging macros. These compile away to nothing unless FITZ_DEBUG_LOCKING is defined. fz_assert_lock_held(ctx, lock) checks that we hold lock. fz_assert_lock_not_held(ctx, lock) checks that we do not hold lock. In addition fz_lock_debug_lock and fz_lock_debug_unlock are used on every fz_lock/fz_unlock to check the validity of the operation we are performing - in particular it checks that we do/do not already hold the lock we are trying to take/drop, and that by taking this lock we are not violating our defined locking order. The RESOLVE macro (used throughout the code to check whether we need to resolve an indirect reference) calls fz_assert_lock_not_held to ensure that we aren't about to resolve an indirect reference (and hence move the stream pointer) when the file is locked. In order to implement the file locking properly, pdf_open_stream (and friends) now lock the file as a side effect (because they fz_seek to the start of the stream). The lock is automatically dropped on an fz_close of such streams. Previously, the glyph cache was created in a context when it was first required; this presents problems as it can be shared between several contexts or not, depending on whether it is created before the contexts are cloned. We now always create it at startup, so it is always shared. This means that we need reference counting for the glyph caches. Added here. In fz_render_glyph, we take the glyph cache lock, and check to see whether the glyph is in the cache. If it is, we bump the refcount, drop the lock and returned the cached character. If it is not, we need to render the character. For freetype based fonts we keep the lock throughout the rendering process, thus ensuring that freetype is only called in a single threaded manner. For type3 fonts, however, we need to invoke the interpreter again to render the glyph streams. This can require reentrance to this routine. We therefore drop the glyph cache lock, call the interpreter to render us our pixmap, and take the lock again. This dropping and retaking of the lock introduces a possible race condition; 2 threads may try to render the same character at the same time. We therefore modify our hash table insert routines to behave differently if it comes to insert an entry only to find that an entry with the same key is already there. We spot this case; if we have just rendered a type3 glyph and when we try to insert it into the cache discover that someone has beaten us to it, we just discard our entry and use the cached one. Hopefully this will seldom be a problem in practise; to solve it properly would require greater complexity (probably involving spotting that another thread is already working on the desired rendering, and sleeping on a semaphore until it completes).
2012-02-07	Update windows viewer to latest changes.	Tor Andersson

2012-02-07	Rename a few functions.	Tor Andersson

2012-02-06	Fix 692841: Look at ConfigureNotify events while waiting for MapNotify.	Tor Andersson
	We used to discard all events until we got a MapNotify, but some window managers send the ConfigureNotify before the window is mapped.
2012-02-03	Free current document page when closing pdf application.	Sebastian Rasmussen

2012-02-03	Be consistent about passing a fz_context in path/text/shade functions.	Tor Andersson

2012-02-03	Be consistent about passing a fz_context argument in pixmap functions.	Tor Andersson

2012-02-03	Reference count fz_link objects.	Tor Andersson

2012-02-03	Use the new document interface in viewer.	Tor Andersson

2012-02-03	Add document interface.	Tor Andersson

2012-02-03	Allow ZIP as extension for CBZ files.	Tor Andersson

2012-02-02	Support remote links in XPS documents.	Robin Watts
	Update xps path handling to cope with URLs. Fix premature freeing of links. Spot remote URLs and use appropriate link type.
2012-02-02	Work on supporting links in xps documents.	Robin Watts
	Currently, this only works with local links. When running the page, check for NavigateUri entries; if found, and that page is not already marked as having resolved it's links, add a new link entry to doc->current_page links. When the page finishes running, mark the page as having resolved it's links. This avoids the links being generated multiple times. Update the mupdf viewer to use these links - but only AFTER the page has been run.
2012-01-31	Make pdfclean more resilient to errors while parsing.	Robin Watts
	Just add some fz_try/fz_catches.
2012-01-30	Do not embed a context in the fz_outline structure.	Tor Andersson

2012-01-30	Add CBZ (comic book zip-file) parser.	Tor Andersson

2012-01-27	Android/Windows build fixes	Robin Watts
	Update Android build for new thirdparty.zip. Small windows fix for pdf_xref -> pdf_document changes.
2012-01-27	Rename pdfdraw to mupdfdraw etc.	Robin Watts
	This a) improves our branding, and b) avoids conflicts with other pdf tools out there (pdfinfo etc).
2012-01-27	Rename pdf_xref type to pdf_document.	Tor Andersson

2012-01-19	Remove confusing optional 'password' argument to pdf_open_xref.	Tor Andersson
	Require that clients call pdf_needs_password/pdf_authenticate_password instead. For dumb clients, we still allow for decrypting a file with a blank password without calling those functions.
2012-01-18	Add fullscreen mode to mupdf viewer.	Tor Andersson
	Toggle with 'f'. Fullscreen turns off shrinkwrap, and shrinkwrap turns off fullscreen.
2012-01-12	Update copyright notices for 2012.	Tor Andersson

2012-01-12	Use the same coordinate system for pdf and xps pages in the interface.	Tor Andersson
	Move coordinate space tweaks into pdf_ and xps_run_page, and provide neutral pdf_ and xps_bound_page functions to return the page size as a zero-origined bounding box.
2012-01-11	Add xps_run_page function.	Tor Andersson

2012-01-11	Use enum for FZ_STORE_DEFAULT default size.	Tor Andersson

2012-01-11	Hide glyph cache in context.	Tor Andersson

2012-01-11	Set default values for alloc context and max store size if none are given.	Sebastian Rasmussen

2012-01-11	Do not unnecessarily use O_BINARY in X11 viewer.	Sebastian Rasmussen

2012-01-10	Automatically load page tree when accessing a page/page count.	Sebastian Rasmussen

2012-01-10	Fix many spelling errors.	Sebastian Rasmussen

2012-01-09	Add fz_try/fz_catch to xpsdraw	Robin Watts
	Avoid unhandled exceptions. Thanks to Zeniko for this.
2012-01-06	pdfshow; cope better with broken objects	Robin Watts
	In pdfshow, if we fail to parse an object, just skip it rather than aborting. Thanks to Zeniko for the suggestion.
2012-01-06	pdfclean; trailer dictionary expansion fix	Robin Watts
	The logic controlling whether to expand a trailer dictionary or not was reversed. Fixed here. Thanks to Zeniko for pointing this out.
2012-01-04	Bug 692739: Add ability to abort time consuming actions	Robin Watts
	A new 'cookie' parameter is added to page rendering/interpretation functions. Supply this as NULL to get existing behaviour. If you supply a non-NULL cookie, then this is taken as a pointer to a struct that can be used for simple, non-thread locked communication between caller and library. The entire struct should be memset to zero before entry, except for specific flags (thus coping with future extensions to this struct). The abort flag should be zero on entry. It will be checked periodically by the library - if the caller sets it non-zero (via another thread) then the current operation will be aborted. No guarantees are given as to how often this will be checked, or how fast it will be responded to. The progress_max field will be set to an integer (-1 for unknown) representing the number of 'things' to do. The progress field will count up from 0 to this number as time goes by. No guarantees are made as to the accuracy of this information, but it should be useful for offering some sort of progress bar etc. Note that progress_max may increase during the job. In general, callers should be careful to accept out of range or invalid data in this structure as this is deliberately accessed 'unlocked'.
2012-01-03	Add mubusy build	Robin Watts
	Add simple combined exe build for mupdf/muxps tools.
2011-12-30	Load outlines in viewer after pages to allow links to work.	Robin Watts
	In order for hyperlinks to work, we need to load the outlines after the pages tree.
2011-12-28	Outline/link destination tweaks.	Robin Watts
	Move 'kind' into the fz_link_dest structure (as this makes more sense). Put an fz_link_dest rather than just a page number into the outlines structure. Correct parsing of actions and dests from pdf outlines.
2011-12-23	Generalise pdf_links to be fz_links.	Robin Watts
	Move to a non-pdf specific type for links. PDF specific parsing is done in pdf_annots.c as before, but the essential type (and handling functions for that type) are in a new file fitz/base_link.c. The new type is more expressive than before; specifically all the possible PDF modes are expressable in it. Hopefully this should allow XPS links to be represented too.
2011-12-17	Memory squeezing fix	Robin Watts

2011-12-16	Another memsqueezing fix.	Robin Watts

2011-12-16	More memsqueezing fixes	Robin Watts

2011-12-15	Various Memsqueezing fixes.	Robin Watts
	Fixes for leaks (and SEGVs, division by zeros etc) seen when Memsqueezing.
2011-12-15	Fix warnings/errors on unix builds.	Robin Watts
	Fix warnings/errors thrown up by the last few commits (which were only tested on windows).
2011-12-15	Add scavenging functionality.	Robin Watts
	When fz_malloc (etc) are about to fail, we try to scavenge memory from the store and then retry. We repeatedly try to bin objects from the store until the malloc succeeds, or until we have nothing else to bin. This means we no longer need the 'aging' of the store, so this is removed.
2011-12-15	Remove 'soft limit' on pixmaps in favour of fz_store.	Robin Watts
	Change the fz_store to be limited to 256 Megs. Remove the soft limit for pixmaps; the store will automatically throw old resources away to stay below the limit.
2011-12-15	Rework pdf_store to fz_store, a part of fz_context.	Robin Watts
	Firstly, we rename pdf_store to fz_store, reflecting the fact that there are no pdf specific dependencies on it. Next, we rework it so that all the objects that can be stored in the store start with an fz_storable structure. This consists of a reference count, and a function used to free the object when the reference count reaches zero. All the keep/drop functions are then reimplemented by calling fz_keep_sharable/fz_drop_sharable. The 'drop' functions as supplied by the callers are thus now 'free' functions, only called if the reference count drops to 0. The store changes to keep all the items in the store in the linked list (which becomes a doubly linked one). We still make use of the hashtable to index into this list quickly, but we now have the objects in an LRU ordering within the list. Every object is put into the store, with a size record; this is an estimate of how much memory would be freed by freeing that object. The store is moved into the context and given a maximum size; when new things are inserted into the store, care is taken to ensure that we do not expand beyond this size. We evict any stored items (that are not in use) starting from the least recently used. Finding an object in the store now takes a reference to it already. LOCK and UNLOCK comments are used to indicate where locks need to be taken and released to ensure thread safety.
2011-12-12	Add -i and -f flags to pdfclean.	Robin Watts
	Update pdfclean with some new decompression options; -d changes it's definition (though not it's behaviour) to mean "decompress all streams". New flags, -i and -f toggle the decompression of image and font streams respectively.
2011-12-09	Fix missing " in win_main.c	Robin Watts

2011-12-08	Stylistic changes when testing pointer values for NULL.	Tor Andersson
	Also: use 'cannot' instead of 'failed to' in error messages.