67 Commits

Author SHA1 Message Date
David Edwards
03d08d0c9e
MNT-22082 transformation of pdf to text hang (#367)
A new constructor has been added to the TikaController to provide
the new spring config.
The creation of the TikaExecutor has been moved to "singleton pattern" as
the injection of the @Value happens after the instantiation of the
TikaJavaExecutor and does not pass the value correctly. The
instantiation is now done once, on the first transform request.
Param has been added to the AIO beans.
2021-04-13 09:59:42 +01:00
Travis CI User
79a48ac385 [maven-release-plugin][skip ci] prepare for next development iteration 2021-03-08 19:06:03 +00:00
Travis CI User
156196dc95 [maven-release-plugin][skip ci] prepare release 2.3.10 2021-03-08 19:05:58 +00:00
Travis CI User
ed7e0c76df [maven-release-plugin][skip ci] prepare for next development iteration 2021-03-08 16:27:11 +00:00
Travis CI User
2e4a4639e1 [maven-release-plugin][skip ci] prepare release 2.3.9 2021-03-08 16:27:06 +00:00
Travis CI User
adc7e291db [maven-release-plugin][skip ci] prepare for next development iteration 2021-02-18 16:07:41 +00:00
Travis CI User
310a20f049 [maven-release-plugin][skip ci] prepare release 2.3.8 2021-02-18 16:07:36 +00:00
dependabot-preview[bot]
1fbd06f6b1
Bump bcmail-jdk15on from 1.64 to 1.68 (#313)
Bumps [bcmail-jdk15on](https://github.com/bcgit/bc-java) from 1.64 to 1.68.
- [Release notes](https://github.com/bcgit/bc-java/releases)
- [Changelog](https://github.com/bcgit/bc-java/blob/master/docs/releasenotes.html)
- [Commits](https://github.com/bcgit/bc-java/commits)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>

Co-authored-by: dependabot-preview[bot] <27856297+dependabot-preview[bot]@users.noreply.github.com>
2021-02-11 18:12:35 +00:00
Travis CI User
c5d7791cb5 [maven-release-plugin][skip ci] prepare for next development iteration 2021-02-02 06:40:35 +00:00
Travis CI User
969968a70e [maven-release-plugin][skip ci] prepare release 2.3.7 2021-02-02 06:40:28 +00:00
David Edwards
ef21365e00
ACS-930 Security update to spring boot 2.4.1 (#321)
* ACS-930 Upgrade to Junit5
2021-01-15 10:31:25 +00:00
dependabot-preview[bot]
0060461695
Bump bcprov-jdk15on from 1.64 to 1.68 (#312)
Bumps [bcprov-jdk15on](https://github.com/bcgit/bc-java) from 1.64 to 1.68.
- [Release notes](https://github.com/bcgit/bc-java/releases)
- [Changelog](https://github.com/bcgit/bc-java/blob/master/docs/releasenotes.html)
- [Commits](https://github.com/bcgit/bc-java/commits)

Signed-off-by: dependabot-preview[bot] <support@dependabot.com>

Co-authored-by: dependabot-preview[bot] <27856297+dependabot-preview[bot]@users.noreply.github.com>
2021-01-14 13:15:41 +00:00
Alan Davis
2fd11d5aed
REPO-5191 Bug: T-Engine should provide mapping rather than the repo. (#316)
Bug found while reviewing documents on how to create a custom metadata extractor. The original refactor had left the repo doing the mapping. It should have been passing the fully qualified repo properties to the T-Engine to do the mapping.

Linked to:
    Alfresco/alfresco-community-repo#227
    Alfresco/acs-packaging#1826
2021-01-06 22:25:40 +00:00
Travis CI User
f0fb98f238 [maven-release-plugin][skip ci] prepare for next development iteration 2020-11-19 20:13:18 +00:00
Travis CI User
d95120fdf4 [maven-release-plugin][skip ci] prepare release 2.3.6 2020-11-19 20:13:11 +00:00
Alan Davis
00fbb6405a
ATS-829 Release T-Engines 2.3.6 (#307)
ATS-829: Release T-Core (T-Engines) 2.3.6 [trigger release]

Linked to REPO-5219 Allow AGS AMP to specify metadata extract mapping

Added an extractMapping transform option to all metadata extractors to override the default one.

3rd party libraries to get a green build.
* Upgrade cxf-rt-transports-http and woodstox-core to avoid issues
* Upgrade to org.springframework.boot:spring-boot-starter-parent:2.3.5.RELEASE to avoid problem in org.springframework:spring-web
* Upgrade to activemq 5.15.13 to avoid problem in activemq-broker 5.15.12
2020-11-19 18:35:22 +00:00
Travis CI User
3ef6a7a788 [maven-release-plugin][skip ci] prepare for next development iteration 2020-09-25 08:53:59 +00:00
Travis CI User
37c4f682fa [maven-release-plugin][skip ci] prepare release 2.3.5 2020-09-25 08:53:53 +00:00
Travis CI User
608fdc1ab4 [maven-release-plugin][skip ci] prepare for next development iteration 2020-08-06 15:09:04 +00:00
Travis CI User
b7c4ca02cc [maven-release-plugin][skip ci] prepare release 2.3.4 2020-08-06 15:08:57 +00:00
eknizat
0273fd5c07
ATS-816: Fix tika apple keynote (#285)
* ATS-816: Fix tika apple keynote
The application/vnd.apple.keynote -> text/plain transformation has been found to fail after switching the version of tika in ATS-801
The previous version of tika would use the org.apache.tika.parser.pkg.PackageParser but the new version uses an empty parser producing empty target file.

* Re enable test for application/vnd.apple.keynote to text
2020-08-06 12:30:20 +01:00
Travis CI User
33eff2d8d7 [maven-release-plugin][skip ci] prepare for next development iteration 2020-08-03 10:10:09 +00:00
Travis CI User
66ff8a950c [maven-release-plugin][skip ci] prepare release 2.3.3 2020-08-03 10:10:01 +00:00
montgolfiere
c267854e12
ATS-801 (part of ACS-387) - update to core Tika 1.24.1 / Poi 4.1.2 (#269)
* ATS-801: Tika Update - part 1 (T-Core) sanity check

- initially switch to 1.21 (to see if any unit/quick tests fail in T-Core)

* ATS-801: Tika Update - part 1 (T-Core) sanity check

- 1st attempt to bump to Tika 1.24.1 /  Poi 4.1.2
2020-07-03 15:46:24 +01:00
Travis CI User
4646e016d6 [maven-release-plugin][skip ci] prepare for next development iteration 2020-07-02 16:04:21 +00:00
Travis CI User
03f37d5004 [maven-release-plugin][skip ci] prepare release 2.3.2 2020-07-02 16:04:13 +00:00
Travis CI User
8cdbc00424 [maven-release-plugin][skip ci] prepare for next development iteration 2020-06-24 17:07:23 +00:00
Travis CI User
b92f6794ac [maven-release-plugin][skip ci] prepare release 2.3.1 2020-06-24 17:07:15 +00:00
Travis CI User
76457cb6e8 [maven-release-plugin][skip ci] prepare for next development iteration 2020-06-16 17:34:27 +00:00
Travis CI User
5da2a54ff1 [maven-release-plugin][skip ci] prepare release 2.3.0 2020-06-16 17:34:17 +00:00
Jan Vonka
bb939596ad ATS-779: Bump to 2.3.0-SNAPSHOT
- as per new T-Base "transformImpl" (see ATS-777 / REPO-4334)
2020-06-16 14:50:22 +01:00
Travis CI User
401fcaf2ca [maven-release-plugin][skip ci] prepare for next development iteration 2020-06-15 17:20:58 +00:00
Travis CI User
f5025483f2 [maven-release-plugin][skip ci] prepare release 2.2.3 2020-06-15 17:20:51 +00:00
Abdul Mohammed
66917c3744
Merge pull request #255 from Alfresco/fix/MNT-21487-use-full-ooxml-jar
MNT-21487: Switch from smaller schemas jar (poi-ooxml-schemas) to the full jar (ooxml-schemas)
2020-06-12 11:54:06 +01:00
Alan Davis
06109dee75
REPO-4334 Move metadata extraction into T-Engines (#247)
* Metadata extract code added to T-Engines
* Required a refactor of duplicate code to avoid 3x more duplication:
        - try catches used to return return exit codes
        - calls to java libraries or commands to external processes
        - building of transform options in controllers, adaptors
* integration tests based on current extracts performed in the repo
* included extract code for libreoffice, and embed code even though not used out of the box any more. There may well be custom extracts using them that move to T-Engines
* removal of unused imports
* minor autoOrient / allowEnlargement bug fixes that were not included in Paddington on the T-Engine side.
2020-06-11 20:20:22 +01:00
Abdul Mohammed
ca768c7964 Replace poi-ooxml-schemas with ooxml-schemas 2020-06-11 15:13:18 +01:00
Ayman Harake
9931bdc678
ATS-763: Update T-Core for Legacy: Add test files & tests for newly added transforms (in ATS-731) (#252)
* ATS-763: Added missing tests in Ticka

* ATS-763: Added the missing transform tests for Libre Office and replaced quick files in Ticka

* ATS-763: Replaced newly added quick.xml and quick.msg with preexisting files.

* ATS-763: Added targets to tests in Libre Office -see Jan's comment in PR

* ATS-763: Added test files to Image Magick, and uncommented the PSD source file

* ATS-763: put back a comment in Image Magick how it was before my previous commit

* ATS-763: Resolved Jan's comment about seperating out mimetypes into their correct section such as SPREADSHEET or PRESENTATION

* ATS-763: Fixed failing test (ppsm and ppsx)

* ATS-763: Removed unnecessary source files in Image Magick

* ATS-763: Fix failing LibreOffice unit tests

* ATS-763: Fix indentation in LibreOfficeTransformationIT

* ATS-763: fixed failing image magick tests and removed failing transform from config

* ATS-763: Added missing priority for pages -> txt

Co-authored-by: kristian <kristian.dimitrov@alfresco.com>
2020-06-09 16:57:23 +01:00
Travis CI User
dbf8568229 [maven-release-plugin][skip ci] prepare for next development iteration 2020-06-02 14:31:42 +00:00
Travis CI User
33a9e22181 [maven-release-plugin][skip ci] prepare release 2.2.2 2020-06-02 14:31:35 +00:00
Kristian Dimitrov
f6819cf8e0
ATS-731 Update T-Engines config with remaining legacy transformers
* ATS-731: Add half of the missing simple legacy transforms

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Updated transforms list to include missing legacy transforms

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added missing Transforms

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added xml xltm, xlam, ppsx, ppsm, msg, and dita to pdf

* ATS-731: Remove depreciated workaround

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added more missing transforms. Only 6 left to do

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Merged with Kristian's changes

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added Kristians' last commit back

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Added final 6 missing transforms

* ATS-731: Remove unnecessary test configs (Tests now pull configs from jars)

* task/ATS-731_Update_T-Engines_config_with_remaining_legacy_transformers: Reverted libra office file back to how it was at Kristian's last commit

* task/ATS-731_Update_T: Took back out the outlook transforms so that they can be done by a pipeline inestead

* ATS-731: Read default engine configs from jars in tests

* ATS-731: Removed failing transforms from Image magick

* ATS-731: The branch now only contains the transforms that work and have been tested. Just one more needs to be added in libre office

* Added the last of the working transfroms

* ATS-731: Added one more transform

* Revert "ATS-731: Remove depreciated workaround"

This reverts commit 82de937e

* ATS-731: Enable info log level for the depreciated workaround

Co-authored-by: aharake <ayman.harake@alfresco.com>
2020-06-01 15:54:15 +01:00
Travis CI User
6b2725c77e [maven-release-plugin][skip ci] prepare for next development iteration 2020-05-01 14:59:23 +00:00
Travis CI User
65fc8d2912 [maven-release-plugin][skip ci] prepare release 2.2.1 2020-05-01 14:59:16 +00:00
Travis CI User
a8b9a42ce7 [maven-release-plugin][skip ci] prepare for next development iteration 2020-04-24 13:06:08 +00:00
Travis CI User
2b764c787d [maven-release-plugin][skip ci] prepare release 2.2.0 2020-04-24 13:06:01 +00:00
David Edwards
f503b863db ATS-708 Update pom versions to 2.2.0-SNAPSHOT 2020-04-24 12:30:28 +01:00
Travis CI User
81d691dfce [maven-release-plugin][skip ci] prepare for next development iteration 2020-04-23 17:16:58 +00:00
Travis CI User
1ddc63dc55 [maven-release-plugin][skip ci] prepare release 2.2.0-A5 2020-04-23 17:16:51 +00:00
David Edwards
0eda874c82
ATS-702 Add AIO tests from Misc Transformers (#234) 2020-04-23 12:35:27 +01:00
David Edwards
bcb6626965 Revert "ATS-702 Add AIO tests from Misc Transformers (#230)"
This reverts commit b69a17a2a3d7c76ef3344c2fee0bf6c624fae9fb.
2020-04-23 11:23:31 +01:00
David Edwards
b69a17a2a3
ATS-702 Add AIO tests from Misc Transformers (#230)
Add Misc transforms AIO tests
Add Misc IT through AIO
Remove accidental commit.

Co-authored-by: Erik Knizat <erik.knizat@alfresco.com>
Co-authored-by: kristian <kristian.dimitrov@alfresco.com>
Co-authored-by: eknizat <26163420+eknizat@users.noreply.github.com>
2020-04-23 11:05:40 +01:00