mupdf - MuPDF PDF reader and library

Age	Commit message (Collapse)	Author
2014-01-10	Fix build_filter_chain not to leak if pdf_array_get fails.	Robin Watts
	In the existing code, if build_filter fails, chain will be freed. If pdf_array_get fails however, it will leak. Rectify this. No specific bug or example file, just observation arising from discussions about previous commit.
2014-01-09	prevent two further heap access violations	Simon Bünzli
	pdf_open_raw_renumbered_stream and pdf_open_image_stream both have the same issue that 98a111c8e49916f8f5ac21d11f4627540f9ddd49 fixes.
2014-01-09	Bug 694878: Fix SEGV due to double free	Robin Watts
	When constructing a filter chain, we pass ownership of 'chain' inwards. This means we need to be careful not to double close chain. This fixes: 5df97f8539d31745f1c45cc9e1468825_asan_heap-oob_a59afe_1862_225.pdf a736faf6f4a34b7ad8eff207ba52aa57_asan_heap-oob_a59dd9_5744_4860.pdf Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the fuzzing files.
2014-01-09	Add -o option for mutool show.	Tor Andersson
	Windows doesn't like redirecting binary output, so add an explicit filename argument.
2014-01-08	fuzzing fix for null colorspace derefence.	Robin Watts
	Bad annotation appearance streams can cause font_recs to have invalid values in. Avoid this partly by hardening the code against duff values, and partly by setting sane defaults before the parsing. This can be seen in: 33bfbe117bfef7fafc3f927acf50a2e7_signal_sigsegv_81dd96_6257_5205.pdf Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the example files.
2014-01-08	Fix fuzzing bug due to float representation limitations.	Robin Watts
	The gel bbox was being stored internally as floats (despite only holding ints). This means that as numbers get large the bbox can become approximate, rather than exact. If the bbox becomes smaller than it should, this causes crashes in the scanline filling code. This is seen with: tests_private/fuzzing/mupdf2/17f8aee51ac776994af0b36195cdadd7_signal_sigsegv_5607be_7308_5912.pdf The solution is simply to use ints rather than floats. Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the example files.
2014-01-08	Fuzzing fix: Overrun in fz_predict_png	Robin Watts
	If a file specifies a silly number of bpp in the PNG predictor it can overrun a buffer. This was shown by: tests_private/fuzzing/mupdf2/013b2dcbd0207501e922910ac335eb59_*.pdf but no longer shows up due to Simons earlier fix. Following discussion we still think it's worth having this fix in, as truncated data streams can cause len < bpp. Possibly we should throw an error here, but I think that's not necessary as we will return the short length, and the image reading code will notice that the image is truncated already. Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the fuzzing files.
2014-01-08	prevent heap access violation in pdf_cache_object	Simon Bünzli
	pdf_load_obj_stm may resize the xref if it finds further objects in the stream, that might however invalidate any pdf_xref_entry hold such as the one in pdf_cache_object. This can be seen e.g. with 7ac3ad9ddad98d10b947a43cf640062f_asan_heap-uaf_930b78_1007_1675.pdf Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the example files.
2014-01-08	sanitize crypt revision in pdf_new_crypt	Simon Bünzli
	(Second part of Simons patch - apologies for missing this the first time). This correctly enables the sanitization of the key length needed for 90db34f64037e2a8a5c3b6a518ba4153_asan_heap-oob_9b117e_1197_1802.pdf Thanks to Mateusz Jurczyk and Gynvael Coldwind of the Google Security Team for providing the example files.
2014-01-08	sanitize number of columns in fz_open_faxd	Simon Bünzli
	If columns is quite close to INT_MAX, the column index max overflow in find_changing which causes an access violation in the next getbits. This happens e.g. with 0c76a20163f30ea8ec860c4e588ce337_signal_sigsegv_5e7b28_9115_7127.pdf
2014-01-08	sanitize crypt revision in pdf_new_crypt	Simon Bünzli
	This correctly enables the sanitization of the key length needed for 90db34f64037e2a8a5c3b6a518ba4153_asan_heap-oob_9b117e_1197_1802.pdf
2014-01-08	sanitize values in fz_open_predict	Simon Bünzli
	This fixes a NULL pointer dereference in 2192b04848b2d8210d1a33e3ddeb2742_asan_heap-oob_a5a57d_2745_2844.pdf Also, replace MAXC with FZ_MAX_COLORS.
2014-01-07	Introduce 'document handlers'.	Robin Watts
	We define a document handler for each file type (2 in the case of PDF, one to handle files with the ability to 'run' them, and one without). We then register these handlers with the context at startup, and then call fz_open_document... as usual. This enables people to select the document types they want at will (and even to extend the library with more document types should they wish).
2014-01-06	Bug 694869: Fix indetermisms with broken PNG files.	Robin Watts
	This bug shows 2 problems with our data handling. Firstly, if a zip file entry has less data in the stream than it is declared to have, we would leave the end of the data uninitialised. We now put out a warning, and blank it with zeros. Secondly, if the PNG decompression fails to decode enough data, we don't notice. Now we give a warning and blank the remaining pixels.
2014-01-06	reuse JBIG2Globals	Simon Bünzli
	Certain optimized documents use a rather large common symbol dictionary for all JBIG2 images. Caching these JBIG2Globals speeds up loading and rendering of such documents.
2014-01-06	show jbig2dec warnings/errors in stderr	Simon Bünzli
	This helps debugging issues with JBIG2 images. Conflicts: source/fitz/filter-jbig2.c
2014-01-06	add stub files for JPEG-XR support	Simon Bünzli
	See SumatraPDF's repo for a Windows-only implementation using WIC.
2014-01-06	tolerate slightly broken page trees	Simon Bünzli
	At https://code.google.com/p/sumatrapdf/issues/detail?id=2460 , there's a file with missing /Type keys in the page tree nodes. In that case, leaf nodes and intermediary nodes have to be distinguished in a different way.
2014-01-06	fix MSVC warnings C4054 and C4152	Simon Bünzli
	These warnings are caused by casting function pointers to void* instead of proper function types.
2014-01-06	fix various MSVC warnings	Simon Bünzli
	Some warnings we'd like to enable for MuPDF and still be able to compile it with warnings as errors using MSVC (2008 to 2013): * C4115: 'timeval' : named type definition in parentheses * C4204: nonstandard extension used : non-constant aggregate initializer * C4295: 'hex' : array is too small to include a terminating null character * C4389: '==' : signed/unsigned mismatch * C4702: unreachable code * C4706: assignment within conditional expression Also, globally disable C4701 which is frequently caused by MSVC not being able to correctly figure out fz_try/fz_catch code flow. And don't define isnan for VS2013 and later where that's no longer needed.
2014-01-02	Add rebinding for fz_devices and fz_documents	Robin Watts
	The SVG device needs rebinding as it holds a file. The PDF device needs to rebind the underlying pdf document. All documents need to rebind their underlying streams.
2014-01-02	Add rebinding for fz_streams.	Robin Watts

2014-01-02	Add rebinding for fz_output.	Robin Watts

2014-01-02	Bug 694585: Further improve mesh rendering times	Robin Watts
	Add a cached color converter mechanism. Use this for rendering meshes to speed repeated conversions. This reduces a (release build to ppm at default resolution) run from 23.5s to 13.2 seconds.
2014-01-02	Bug 694585: Slow rendering of meshes	Robin Watts
	In the existing code for meshes, we decompose the mesh down into quads (or triangles) and then call a process routine to actually do the work. This process routine typically maps each vertexes position/color and plots it. As each vertex is used several times by neighbouring patches, this results in each vertex being processed several times. The fix in this commit is therefore to break the processing into 'prepare' and 'process' phases. Each vertex is 'prepared' before being used in the 'process' phase. This cuts the number of prepare operations in half. In testing, this reduced the time for a (release build, generating ppm at default resolution) run from 33.4s to 23.5s.
2014-01-02	Improve PDF repair logic.	Robin Watts
	When we meet a broken PDF file, we attempt to repair it. We do this by reading tokens from the file and attempting to interpret them as a normal PDF stream. Unfortunately, if the file is corrupt enough so that we start to read from the middle of a stream, and we happen to hit an '(' character, we can go into string reading mode. We can then end up skipping over vast swathes of file that we could otherwise repair. We fix this here by using a new version of the pdf_lex function that refuses to ever return a string. This means we may take more time over skipping things than we did before, but are less likely to skip stuff. We also tweak other parts of the pdf repair logic here. If we hit a badly formed piece of data, clear the num/gen we have stored so that the next plausible piece we get does not get assigned to a random object number.
2014-01-02	Cull code unused as a result of the "tolerate inline images..." fix.	Robin Watts
	Remove code that's not used any more as a result of the previous fix, plus some code that was unused anyway.
2014-01-02	fix memory leak in pdf_repair_xref	Simon Bünzli
	The 0 null object is leaked if a document refers to 0 0 obj before requiring a delayed reparation (seen e.g. with 3324.pdf.asan.3.2585).
2014-01-02	Fix memory leak in pdf_xref_size_from_old_trailer.	Robin Watts
	Thanks to Simon for spotting the original problem. This is a slight tweak on the patch he supplied.
2014-01-02	Bug 694569: suspicious for loop in pdf-device.c	Paul Gardiner
	Replace an explicit i = i by a comment in a for loop where i is already at the correct starting value.
2014-01-02	Bug 694729: Rounding the Ink	Paul Gardiner
	Use round caps and joins so as to better match the result of drawing, and also so that single dots display. Thanks to Michael Cadilhac for the suggestion.
2014-01-01	Bug 693320: Avoid unaligned accesses in SHA routines.	Robin Watts
	Avoid unnecessary copies. Minimise calls to isbigendian.
2014-01-01	tolerate inline images without EOF markers	Simon Bünzli
	This is required for e.g. 1980_-_compressed_inline_image.pdf and Bug690300.pdf .
2014-01-01	don't fail on invalid object streams	Simon Bünzli
	At https://code.google.com/p/sumatrapdf/issues/detail?id=2436 , there's a document with an empty xref section which since recently causes a repair to be triggered. Repairs then stop when pdf_repair_obj_stms fails on an object which isn't even required for the document to render. Such broken object streams should rather be ignored same as broken objects are ignored in pdf_init_document.
2013-12-30	Bug 694564: Move pdf_lookup_page_obj to be iterative	Robin Watts
	Avoid recursion to avoid stack overflows.
2013-12-24	Bug 694587: Fix pattern repeat calculation	Robin Watts
	The pattern repeat calculation should be done in pattern space, but one of the arguments in the calculation was being taken from device space. Fix this. Also only apply the bias in the case where the bias would make it larger. 173 progressions.
2013-12-24	Bug 694810: Implement late file repair for PDFs.	Robin Watts
	Currently, if we spot a bad xref as we are reading a PDF in, we can repair that PDF by doing a long exhaustive read of the file. This reconstructs the information that was in the xref, and the file can be opened (and later saved) as normal. If we hit an object that is not in the expected place however, we cannot trigger a repair at that point - so xrefs with duff offsets in (within the bounds of the file) will never be repaired. This commit solves that by triggering a repair (just once) whenever we fail to parse an object in the expected place.
2013-12-23	Bug 694712: Do not create empty Contents stream for new pages.	Robin Watts
	Empty Contents streams are not valid - they need a length at least. The alternative approach would be to put /Length 0 and update it later.
2013-12-23	Bug 694715: Fix typo in error message	Robin Watts
	Thanks to Michael Cadilhac for spotting this.
2013-12-23	Bug 694749: Fix transformation of hinted glyphs	Robin Watts
	Simple typo. Thanks to Alexander Monakov for spotting this.
2013-12-23	Bug 694794: Fix blendmode use in pdf device.	Robin Watts
	Previously we were setting blendmode in the created form XObjects transparency group definition. This didn't work as PDF readers don't look for it there. Now we set it in the calling stream's resources, and set it before calling the group.
2013-12-23	Bug 694770: Fix typo in error message.	Robin Watts
	Thanks to Makoto Fujiwara for spotting this.
2013-12-19	Solve subpixel rendering problems with 270 degree rotations	Robin Watts
	It seems that (int)-98.5 = 98, not -99. Use floorf instead.
2013-12-17	Bug 694810: Fix crash when clicking on mupdf window after failed page load.	Robin Watts
	If we have a NULL page, don't attempt to pass events to it.
2013-12-17	Remove fz_context from pdf_crypt	Robin Watts
	Unused field. Also tweak some comments for clarity.
2013-12-16	javascript: fix missing type destruction	Paul Gardiner

2013-12-16	JavaScriptCore-based implementation of the MuPDF JS engine API	Paul Gardiner

2013-11-28	Bug 694127: Valgrind fix for pdf_decode_cmap	Robin Watts
	A poorly formed string can cause us to overrun the end of the buffer. Now we check the end of the string at each stage to avoid this.
2013-11-28	Bug 694118: Fix valgrind warning caused by overflowing function	Robin Watts
	We were miscalculating the offsets into a sampled functions table, causing us to overrun the end. Fixed here.
2013-11-27	fix regression from da277059b37380d57028ff79a636f4d725c96e8f	Simon Bünzli
	The changes to fz_render_glyph cause the scissor rectangle to no longer match the transformation matrix which causes Type 3 glyphs to be clipped at larger resolutions.