Blame NEWS

Packit 504f36
News about PCRE2 releases
Packit 504f36
-------------------------
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.32 10-September-2018
Packit 504f36
-------------------------------
Packit 504f36
Packit 504f36
This is another mainly bugfix and tidying release with a few minor
Packit 504f36
enhancements. These are the main ones:
Packit 504f36
Packit 504f36
1. pcre2grep now supports the inclusion of binary zeros in patterns that are
Packit 504f36
read from files via the -f option.
Packit 504f36
Packit 504f36
2. ./configure now supports --enable-jit=auto, which automatically enables JIT
Packit 504f36
if the hardware supports it.
Packit 504f36
Packit 504f36
3. In pcre2_dfa_match(), internal recursive calls no longer use the stack for
Packit 504f36
local workspace and local ovectors. Instead, an initial block of stack is
Packit 504f36
reserved, but if this is insufficient, heap memory is used. The heap limit
Packit 504f36
parameter now applies to pcre2_dfa_match().
Packit 504f36
Packit 504f36
4. Updated to Unicode version 11.0.0.
Packit 504f36
Packit 504f36
5. (*ACCEPT:ARG), (*FAIL:ARG), and (*COMMIT:ARG) are now supported.
Packit 504f36
Packit 504f36
6. Added support for \N{U+dddd}, but only in Unicode mode.
Packit 504f36
Packit 504f36
7. Added support for (?^) to unset all imnsx options.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.31 12-February-2018
Packit 504f36
------------------------------
Packit 504f36
Packit 504f36
This is mainly a bugfix and tidying release (see ChangeLog for full details).
Packit 504f36
However, there are some minor enhancements.
Packit 504f36
Packit 504f36
1. New pcre2_config() options: PCRE2_CONFIG_NEVER_BACKSLASH_C and
Packit 504f36
PCRE2_CONFIG_COMPILED_WIDTHS.
Packit 504f36
Packit 504f36
2. New pcre2_pattern_info() option PCRE2_INFO_EXTRAOPTIONS to retrieve the
Packit 504f36
extra compile time options.
Packit 504f36
Packit 504f36
3. There are now public names for all the pcre2_compile() error numbers.
Packit 504f36
Packit 504f36
4. Added PCRE2_CALLOUT_STARTMATCH and PCRE2_CALLOUT_BACKTRACK bits to a new
Packit 504f36
field callout_flags in callout blocks.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.30 14-August-2017
Packit 504f36
----------------------------
Packit 504f36
Packit 504f36
The full list of changes that includes bugfixes and tidies is, as always, in
Packit 504f36
ChangeLog. These are the most important new features:
Packit 504f36
Packit 504f36
1. The main interpreter, pcre2_match(), has been refactored into a new version
Packit 504f36
that does not use recursive function calls (and therefore the system stack) for
Packit 504f36
remembering backtracking positions. This makes --disable-stack-for-recursion a
Packit 504f36
NOOP. The new implementation allows backtracking into recursive group calls in
Packit 504f36
patterns, making it more compatible with Perl, and also fixes some other
Packit 504f36
previously hard-to-do issues. For patterns that have a lot of backtracking, the
Packit 504f36
heap is now used, and there is an explicit limit on the amount, settable by
Packit 504f36
pcre2_set_heap_limit() or (*LIMIT_HEAP=xxx). The "recursion limit" is retained,
Packit 504f36
but is renamed as "depth limit" (though the old names remain for
Packit 504f36
compatibility).
Packit 504f36
Packit 504f36
There is also a change in the way callouts from pcre2_match() are handled. The
Packit 504f36
offset_vector field in the callout block is no longer a pointer to the
Packit 504f36
actual ovector that was passed to the matching function in the match data
Packit 504f36
block. Instead it points to an internal ovector of a size large enough to hold
Packit 504f36
all possible captured substrings in the pattern.
Packit 504f36
Packit 504f36
2. The new option PCRE2_ENDANCHORED insists that a pattern match must end at
Packit 504f36
the end of the subject.
Packit 504f36
Packit 504f36
3. The new option PCRE2_EXTENDED_MORE implements Perl's /xx feature, and
Packit 504f36
pcre2test is upgraded to support it. Setting within the pattern by (?xx) is
Packit 504f36
also supported.
Packit 504f36
Packit 504f36
4. (?n) can be used to set PCRE2_NO_AUTO_CAPTURE, because Perl now has this.
Packit 504f36
Packit 504f36
5. Additional compile options in the compile context are now available, and the
Packit 504f36
first two are: PCRE2_EXTRA_ALLOW_SURROGATE_ESCAPES and
Packit 504f36
PCRE2_EXTRA_BAD_ESCAPE_IS_LITERAL.
Packit 504f36
Packit 504f36
6. The newline type PCRE2_NEWLINE_NUL is now available.
Packit 504f36
Packit 504f36
7. The match limit value now also applies to pcre2_dfa_match() as there are
Packit 504f36
patterns that can use up a lot of resources without necessarily recursing very
Packit 504f36
deeply.
Packit 504f36
Packit 504f36
8. The option REG_PEND (a GNU extension) is now available for the POSIX
Packit 504f36
wrapper. Also there is a new option PCRE2_LITERAL which is used to support
Packit 504f36
REG_NOSPEC.
Packit 504f36
Packit 504f36
9. PCRE2_EXTRA_MATCH_LINE and PCRE2_EXTRA_MATCH_WORD are implemented for the
Packit 504f36
benefit of pcre2grep, and pcre2grep's -F, -w, and -x options are re-implemented
Packit 504f36
using PCRE2_LITERAL, PCRE2_EXTRA_MATCH_WORD, and PCRE2_EXTRA_MATCH_LINE. This
Packit 504f36
is tidier and also fixes some bugs.
Packit 504f36
Packit 504f36
10. The Unicode tables are upgraded from Unicode 8.0.0 to Unicode 10.0.0.
Packit 504f36
Packit 504f36
11. There are some experimental functions for converting foreign patterns
Packit 504f36
(globs and POSIX patterns) into PCRE2 patterns.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.23 14-February-2017
Packit 504f36
------------------------------
Packit 504f36
Packit 504f36
1. ChangeLog has the details of a lot of bug fixes and tidies.
Packit 504f36
Packit 504f36
2. There has been a major re-factoring of the pcre2_compile.c file. Most syntax
Packit 504f36
checking is now done in the pre-pass that identifies capturing groups. This has
Packit 504f36
reduced the amount of duplication and made the code tidier. While doing this,
Packit 504f36
some minor bugs and Perl incompatibilities were fixed (see ChangeLog for
Packit 504f36
details.)
Packit 504f36
Packit 504f36
3. Back references are now permitted in lookbehind assertions when there are
Packit 504f36
no duplicated group numbers (that is, (?| has not been used), and, if the
Packit 504f36
reference is by name, there is only one group of that name. The referenced
Packit 504f36
group must, of course be of fixed length.
Packit 504f36
Packit 504f36
4. \g{+<number>} (e.g. \g{+2} ) is now supported. It is a "forward back
Packit 504f36
reference" and can be useful in repetitions (compare \g{-<number>} ). Perl does
Packit 504f36
not recognize this syntax.
Packit 504f36
Packit 504f36
5. pcre2grep now automatically expands its buffer up to a maximum set by
Packit 504f36
--max-buffer-size.
Packit 504f36
Packit 504f36
6. The -t option (grand total) has been added to pcre2grep.
Packit 504f36
Packit 504f36
7. A new function called pcre2_code_copy_with_tables() exists to copy a
Packit 504f36
compiled pattern along with a private copy of the character tables that is
Packit 504f36
uses.
Packit 504f36
Packit 504f36
8. A user supplied a number of patches to upgrade pcre2grep under Windows and
Packit 504f36
tidy the code.
Packit 504f36
Packit 504f36
9. Several updates have been made to pcre2test and test scripts (see
Packit 504f36
ChangeLog).
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.22 29-July-2016
Packit 504f36
--------------------------
Packit 504f36
Packit 504f36
1. ChangeLog has the details of a number of bug fixes.
Packit 504f36
Packit 504f36
2. The POSIX wrapper function regcomp() did not used to support back references
Packit 504f36
and subroutine calls if called with the REG_NOSUB option. It now does.
Packit 504f36
Packit 504f36
3. A new function, pcre2_code_copy(), is added, to make a copy of a compiled
Packit 504f36
pattern.
Packit 504f36
Packit 504f36
4. Support for string callouts is added to pcre2grep.
Packit 504f36
Packit 504f36
5. Added the PCRE2_NO_JIT option to pcre2_match().
Packit 504f36
Packit 504f36
6. The pcre2_get_error_message() function now returns with a negative error
Packit 504f36
code if the error number it is given is unknown.
Packit 504f36
Packit 504f36
7. Several updates have been made to pcre2test and test scripts (see
Packit 504f36
ChangeLog).
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.21 12-January-2016
Packit 504f36
-----------------------------
Packit 504f36
Packit 504f36
1. Many bugs have been fixed. A large number of them were provoked only by very
Packit 504f36
strange pattern input, and were discovered by fuzzers. Some others were
Packit 504f36
discovered by code auditing. See ChangeLog for details.
Packit 504f36
Packit 504f36
2. The Unicode tables have been updated to Unicode version 8.0.0.
Packit 504f36
Packit 504f36
3. For Perl compatibility in EBCDIC environments, ranges such as a-z in a
Packit 504f36
class, where both values are literal letters in the same case, omit the
Packit 504f36
non-letter EBCDIC code points within the range.
Packit 504f36
Packit 504f36
4. There have been a number of enhancements to the pcre2_substitute() function,
Packit 504f36
giving more flexibility to replacement facilities. It is now also possible to
Packit 504f36
cause the function to return the needed buffer size if the one given is too
Packit 504f36
small.
Packit 504f36
Packit 504f36
5. The PCRE2_ALT_VERBNAMES option causes the "name" parts of special verbs such
Packit 504f36
as (*THEN:name) to be processed for backslashes and to take note of
Packit 504f36
PCRE2_EXTENDED.
Packit 504f36
Packit 504f36
6. PCRE2_INFO_HASBACKSLASHC makes it possible for a client to find out if a
Packit 504f36
pattern uses \C, and --never-backslash-C makes it possible to compile a version
Packit 504f36
PCRE2 in which the use of \C is always forbidden.
Packit 504f36
Packit 504f36
7. A limit to the length of pattern that can be handled can now be set by
Packit 504f36
calling pcre2_set_max_pattern_length().
Packit 504f36
Packit 504f36
8. When matching an unanchored pattern, a match can be required to begin within
Packit 504f36
a given number of code units after the start of the subject by calling
Packit 504f36
pcre2_set_offset_limit().
Packit 504f36
Packit 504f36
9. The pcre2test program has been extended to test new facilities, and it can
Packit 504f36
now run the tests when LF on its own is not a valid newline sequence.
Packit 504f36
Packit 504f36
10. The RunTest script has also been updated to enable more tests to be run.
Packit 504f36
Packit 504f36
11. There have been some minor performance enhancements.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.20 30-June-2015
Packit 504f36
--------------------------
Packit 504f36
Packit 504f36
1. Callouts with string arguments and the pcre2_callout_enumerate() function
Packit 504f36
have been implemented.
Packit 504f36
Packit 504f36
2. The PCRE2_NEVER_BACKSLASH_C option, which locks out the use of \C, is added.
Packit 504f36
Packit 504f36
3. The PCRE2_ALT_CIRCUMFLEX option lets ^ match after a newline at the end of a
Packit 504f36
subject in multiline mode.
Packit 504f36
Packit 504f36
4. The way named subpatterns are handled has been refactored. The previous
Packit 504f36
approach had several bugs.
Packit 504f36
Packit 504f36
5. The handling of \c in EBCDIC environments has been changed to conform to the
Packit 504f36
perlebcdic document. This is an incompatible change.
Packit 504f36
Packit 504f36
6. Bugs have been mended, many of them discovered by fuzzers.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.10 06-March-2015
Packit 504f36
---------------------------
Packit 504f36
Packit 504f36
1. Serialization and de-serialization functions have been added to the API,
Packit 504f36
making it possible to save and restore sets of compiled patterns, though
Packit 504f36
restoration must be done in the same environment that was used for compilation.
Packit 504f36
Packit 504f36
2. The (*NO_JIT) feature has been added; this makes it possible for a pattern
Packit 504f36
creator to specify that JIT is not to be used.
Packit 504f36
Packit 504f36
3. A number of bugs have been fixed. In particular, bugs that caused building
Packit 504f36
on Windows using CMake to fail have been mended.
Packit 504f36
Packit 504f36
Packit 504f36
Version 10.00 05-January-2015
Packit 504f36
-----------------------------
Packit 504f36
Packit 504f36
Version 10.00 is the first release of PCRE2, a revised API for the PCRE
Packit 504f36
library. Changes prior to 10.00 are logged in the ChangeLog file for the old
Packit 504f36
API, up to item 20 for release 8.36. New programs are recommended to use the
Packit 504f36
new library. Programs that use the original (PCRE1) API will need changing
Packit 504f36
before linking with the new library.
Packit 504f36
Packit 504f36
****