diff options
author | Wei Li <weili@chromium.org> | 2017-05-19 22:17:38 -0700 |
---|---|---|
committer | Chromium commit bot <commit-bot@chromium.org> | 2017-05-20 05:30:10 +0000 |
commit | 6c8ed646d1fcb8cce5a01c843c5149d989e6d5f0 (patch) | |
tree | 8561afc3d2f2e705ea6c642acd8645da00b9d096 /fpdfsdk/fpdftext.cpp | |
parent | d15ce4c1e088e8bc084b52b0acdb5f0ef6597f95 (diff) | |
download | pdfium-6c8ed646d1fcb8cce5a01c843c5149d989e6d5f0.tar.xz |
Better identify web links by trimming irrelevant charschromium/3107
Sometimes, web links are written with other text such as punctuations
which makes the extracted web links invalid. We improve this by trimming
invalid chars at the end of host name only URLs. For example, host names
never ends with ';' or ','.
BUG=chromium:720578
Change-Id: Id619025b2153531376d268a69a3a89c3d49fce08
Reviewed-on: https://pdfium-review.googlesource.com/5692
Commit-Queue: Wei Li <weili@chromium.org>
Reviewed-by: Lei Zhang <thestig@chromium.org>
Diffstat (limited to 'fpdfsdk/fpdftext.cpp')
0 files changed, 0 insertions, 0 deletions