Recent Changes¶
This Version¶
- Added new option flag
--interleave
for tdda detect. This causes the_ok
detection fields to be interleaved with the original fields that they refer to in the resulting detection dataset, rather than all appearing together at the far right hand side. This option was actually present in the previous release, but not sufficiently documented. - Fix for the
--write-all
parameter fortdda.referencetest
result regeneration, which had regressed slightly in the previous version.
Older Versions¶
- Improved reporting of differences for text files in
tdda.referencetest
when the actual results do not match the expected file contents. Now fully takes account of theignore
andremove
parameters. - The
ignore_patterns
parameter inassertTextFileCorrect()
(and others) intdda.referencetest
now causes only the portion of a line that matches the regular expressions to be ignored; anything else on the line (before or after the part that matches a regular expression) must be identical in the actual and expected results. This means that you are specifying the part of the line that is allowed to differ, rather than marking an entire line to be ignored. This is a change in functionality, but is what had always been intended. For fuller control (and to get the previous behaviour), you can anchor the expressions with^.*(...).*$
, and then they will apply to the entire line. - The
ignore_patterns
parameter intdda.referencetest
can now accept grouped subexpressions in regular expressions. This allows use of alternations, which were previously not supported. - The
ignore_substrings
parameter inassertTextFileCorrect()
(and others)tdda.referencetest
now only matches lines in the expected file (where you have full control over what will appear there), not in the actual file. This fixes a problem with differences being masked (and not reported as problems) if the actual happened to include unexpected matching content on lines other than where intended. - The
tdda.constraints
package is now more resilient against unexpected type mismatches. Previously, if the type didn’t match, then in some circumstances exceptions would be (incorrectly) raised for other constraints, rather than failures. - The
tdda.constraints
package now supports Pythondatetime.date
fields in Pandas DataFrames, in addition to the existing support ofdatetime.datetime
. - The
tdda.constraints
Python API now provides support for in-memory constraints, by allowing Python dictionaries to be passed in toverify_df()
anddetect_df()
, as an alternative to passing in a.tdda
filename. This allows an application using the library to store its constraints however it wants to, rather than having to use the filesystem (e.g. storing it online and fetching with an HTTPGET
). - The
tdda.constraints
package can now access MySQL databases using the mysql.connector driver, in addition to the MySQLdb and mysqlclient drivers. - The
tdda.rexpy
tool can now quote the regular expressions it produces, with the new--quote
option flag. This makes it easier to copy the expressions to use them on the command line, or embed them in strings in many programming languages. - The Python API now allows you to
import tdda
and then refer to its subpackages viatdda.referencetest
,tdda.constraints
ortdda.rexpy
. Previously you had to explicitly import each submodule separately.