A new constructor has been added to the TikaController to provide
the new spring config.
The creation of the TikaExecutor has been moved to "singleton pattern" as
the injection of the @Value happens after the instantiation of the
TikaJavaExecutor and does not pass the value correctly. The
instantiation is now done once, on the first transform request.
Param has been added to the AIO beans.
* ATS-863: Update to latest Java Base Image on CentOS 8 (& Open JDK 11.0.10)
- to resolve 2 medium (at time of writing) in quay.io
- update to CentOS 7 to 8 (aligned with Repo decision)
* ATS-863: Update to latest Java Base Image on CentOS 8 (& Open JDK 11.0.10)
- to resolve 2 medium (at time of writing) in quay.io
- update to CentOS 7 to 8 (aligned with Repo decision)
- also update ImageMagick from 7.0.10-11 to 7.0.10-59 - see also ACS-1174
* Introduce more flexible reading of UTF-16 data, where there may be a BOM, but the
spec says there should not be one, or the BOM is clearly wrong when looking at the
following characters. The https://en.wikipedia.org/wiki/UTF-16 write up is nice and clear.
* Includes identical correction in data setup in AIOTransformRegistryTest and
MicsControllerTest for a problem found in TextToPdfContentTransformerTest.
* Includes upgrade to latest pdfbox: 2.0.22
Bug found while reviewing documents on how to create a custom metadata extractor. The original refactor had left the repo doing the mapping. It should have been passing the fully qualified repo properties to the T-Engine to do the mapping.
Linked to:
Alfresco/alfresco-community-repo#227Alfresco/acs-packaging#1826
ATS-829: Release T-Core (T-Engines) 2.3.6 [trigger release]
Linked to REPO-5219 Allow AGS AMP to specify metadata extract mapping
Added an extractMapping transform option to all metadata extractors to override the default one.
3rd party libraries to get a green build.
* Upgrade cxf-rt-transports-http and woodstox-core to avoid issues
* Upgrade to org.springframework.boot:spring-boot-starter-parent:2.3.5.RELEASE to avoid problem in org.springframework:spring-web
* Upgrade to activemq 5.15.13 to avoid problem in activemq-broker 5.15.12
* MNT-21869 libreoffice timeout set too high
Reduce default value of timeout for libreoffice from 2000min to 20min
Add option to configure libreoffice timeout externally.
Enable to configure externally the port on which the app starts.
Add external-engine-configuration.md
* Metadata extract code added to T-Engines
* Required a refactor of duplicate code to avoid 3x more duplication:
- try catches used to return return exit codes
- calls to java libraries or commands to external processes
- building of transform options in controllers, adaptors
* integration tests based on current extracts performed in the repo
* included extract code for libreoffice, and embed code even though not used out of the box any more. There may well be custom extracts using them that move to T-Engines
* removal of unused imports
* minor autoOrient / allowEnlargement bug fixes that were not included in Paddington on the T-Engine side.
* ATS-731: Add half of the missing simple legacy transforms
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Updated transforms list to include missing legacy transforms
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added missing Transforms
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added xml xltm, xlam, ppsx, ppsm, msg, and dita to pdf
* ATS-731: Remove depreciated workaround
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added more missing transforms. Only 6 left to do
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Merged with Kristian's changes
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added Kristians' last commit back
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added final 6 missing transforms
* ATS-731: Remove unnecessary test configs (Tests now pull configs from jars)
* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Reverted libra office file back to how it was at Kristian's last commit
* task/ATS-731_Update_T: Took back out the outlook transforms so that they can be done by a pipeline inestead
* ATS-731: Read default engine configs from jars in tests
* ATS-731: Removed failing transforms from Image magick
* ATS-731: The branch now only contains the transforms that work and have been tested. Just one more needs to be added in libre office
* Added the last of the working transfroms
* ATS-731: Added one more transform
* Revert "ATS-731: Remove depreciated workaround"
This reverts commit 82de937e
* ATS-731: Enable info log level for the depreciated workaround
Co-authored-by: aharake <ayman.harake@alfresco.com>
* REPO-5188 SourceEncoding should not be used to select transforms as it is always provided to T-Engines
* Fix test. Now that SourceEncoding is in the options but is not used to select the transformer, we need to add another
option to force the use of the textToPdf transform rather than libreoffice.
* ATS-735: Update to ImageMagick 7.0.10-11 distro bins
* ATS-735: Revert back to previous ImageMagick version 7.0.7-27 to verify whether the latest release is not compatible with ATS
* ATS-735: Found two more references for ImageMagick 7.0.7 in the application-default.yaml for ImageMagick and AIO Transformer
* Changed systemPropertyVaraibles to point to latest ImageMagick bins
* Added new RPM repo for CentOS 7.5.x in order to server new dependencies for ImageMagick 7.0.10-11