- add embedded testArchiveToPdf & ensure ArchiveContentTransformerTest is added to suite (wasn't being run before ?)
- minor refactor of related tests to ensure "Transformers" subsystem is started before checking content transformer registry
- add TEMP workaround to attempt to (re-)start LibreOffice/OpenOffice when running in local env
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@136802 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
* Support the creation of thumbnails for 2008/9 and 2013/14 iWorks file types.
* Support the preview of 2008/9 and 2013/14 iWorks file types. It should be noted that the
embedded PDF files from 2008/9 no longer are used for the preview, so the quality is not
as good as it was in the past. It is however using the embedded JPEG and quality is reasonable.
* AppleIWorksContentTransformer no longer supports PDF as an target mimetype, as the newer
iWoks 2013/14 formats no longer includes an embedded PDF file. Having PDF supported by the
transformer resulted in Share always trying PDF which was one of the reasons that the newer
formats did not preview. The other reason was that the embedded JPEG filename also changed.
* Added 'quick' files for testing.
* Also modified the exception message thrown when the iWorks file did not contain an embedded
preview file, which is optional. An example would is "The source numbers file did not contain
a jpg preview", rather than "Unable to transform numbers file to jpg". This should reduce the
number of issues raised with support.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@135869 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
<< Root cause for these 4 issues was a fix made in 4.2.4 MNT-11279. That fix was trying to
use the file name field value in a document when transformed to PDF. In addition to the
3 problems found, using file names supplied by users is a potential security issue.
We have not found a way to exploit the use of the original file name, but that should
not stop us from closing off the possibility. Fix is being merged to the dot N branches
of 5.2, 5.1, 5.0 and 4.2 so the fix will be in the next service pack for each branch.
The original MNT-16609 fix is also being reverted as the original code included the source
extension in the temporary file name. >>>
Reverse merged 5.2.N (5.2.1)
129407 skopf: MNT-16609 - Alfresco OOTB install doesn't show preview for .EML files on Windows
94626 adavis: Merged HEAD-BUG-FIX (5.1/Cloud) to HEAD (5.1/Cloud)
89429: ACE-2649: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
89386: Merged DEV to 4.2-BUG-FIX (4.2.4)
85587: MNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@133715 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Addition of a while list of Tika derived mimetypes and declared mimetypes that are deemed to be safe (e.g. for .ai and .pdf).
- Based on Andrei's patch (see CR-546), but with the strictMimetypeCheck configuration and logic moved to TransformerConfig
- Rework of DifferrentMimeTypeTest (was not in the patch) as it had become hard to understand.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@130139 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Ensure error when indexer is the caller includes the file name rather than null.
NodeContentGet now calls transform method with TransformationOptions parameter and the original
transformer method was deprecated as it should be been when this extra parameter was added.
- Set the strictMimeTypeCheck and retryTransformOnDifferentMimeType properties on dynamically created
pipeline and fail over transformers from global properties as is don for static transformers.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@128569 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Addition of AbstractContentTransformer2.strictMimeTypeCheck(...) to check that the
declared mimetype is the same as the Tika derived mimetype. Controlled by a new alfresco
global property content.transformer.strict.mimetype.check (set to true by default).
- Added the UnsupportedTransformationException to a number of transformation callers as
they should have been handling it already.
- Modified DifferrentMimeTypeTest to test the new strictMimeTypeCheck.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@128168 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
123502 adavis: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
123484 adavis: MNT-14295 TransformerConfigLimits Behavior Differs on Java 7 vs 8
- Fixed TransformerPropertyNameExtractor so that it discarded limit or
max properties if its sibling max or limit value is supplied and has
a lower value (i.e. it should be used).
- Cleaned up TransformerConfigLimits.setTransformationLimitsFromProperties
- This has been a problem since 4.2.
- The problem only came to light with the switch to Java 8, as the order in which
transformer properties are loaded changed. They were being loaded in MapEntry order.
The problem did not show up in the unit tests, because they too were impacted by
the MapEntry order and ironically loaded the properties in the same order.
- Added unit tests that are supply both max an limit values.
- Enhanced the debug and toString methods in order to trace the problem.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123711 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
122983 rmunteanu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
122910 rmunteanu: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4)
122808 adavis: MNT-15738 Property to disable transformers by use (e.g. doclib thumbnail generation) in 4.2 is not working
- Allow 'use' style properties to be set
- Combine 'use' properties more flexibly with the default ones.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123686 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
107541: Merged 5.0.N (5.0.3) to HEAD-BUG-FIX (5.1/Cloud) (PARTIAL MERGE)
107413: Merged DEV to 5.0.N (5.0.3)
106858 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Cleaning of Javadoc,
107565: MNT-13545 Fix compilation after merge of Javadoc
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@107633 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
103441: Merged 5.0.N (5.0.2) to HEAD-BUG-FIX (5.1/Cloud)
103246: Merged V4.2-BUG-FIX (4.2.5) to 5.0.N (5.0.2)
103035: Merged NESS/4.2.N-2015_03_12 (4.2.5) to V4.2-BUG-FIX (4.2.5)
102240: MNT-13531: EMLTransformer ignoring multipart emails
- used htmlparser to extract text from html mail part
- added test to check if html special chars appear in transformation result
102375: MNT-13531: EMLTransformer ignoring multipart emails
- use plain/text representation if present, prior to html representation on multipart/alternative parts
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@103625 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
102706: Merged 5.0.N (5.0.2) to HEAD-BUG-FIX (5.1/Cloud)
102487: Merged V4.2-BUG-FIX (4.2.5) to 5.0.N (5.0.2)
101885: Merged DEV to V4.2-BUG-FIX (4.2.5)
101133: MNT-13531: EMLTransformer ignoring multipart emails
- updated transfer in order to identify multiparts and correctly process them
- added tests
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@103536 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
99981: Merged 5.0.N (5.0.2) to HEAD-BUG-FIX (5.1/Cloud) (PARTIAL MERGE)
99482: Merged DEV to 5.0.N (5.0.1)
99198 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc typo in project alfresco-jlan
99413 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project alfresco-jlan
99205 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project core
99415 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project data-model
99227 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project file-transfer-receiver
99416 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project legacy-lucene
99417 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project qa-share
99418 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project remote-api
99427 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc in project Repository, letters S..Z
99433 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc in project Repository, letters A..R
99421 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project share-po
99247 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc typo in project slingshot
99248 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project slingshot
99424 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project solr
99426 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project solr4
99253 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project solr-client
99259 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project web-client
99260 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Changed Javadoc parameters inconsistence in project web-framework-commons
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@100501 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
92488: Merged 5.0.N (5.0.1) to HEAD-BUG-FIX (5.1/Cloud)
92487: Merged V4.2-BUG-FIX (4.2.5) to 5.0.N (5.0.1)
92486: MNT-13015 Some Transformer tests failing in JDK8
- TransformerDebug was listing unsupported transformers in different ways in Java 7 & 8. Had no impact on usage, but broke a test.
- Fixed a logic error highlighted by the switch to Java 8 in 5.1 to do with the default supported or unsupported values if there
are some supported or unsupported properties specified, but the mimetype transformation being checked was not one of them.
I don't think this will have come up causing any problems with the standard configuration defined in transformations.properties,
but might in customisations created by customers, which is why this has been committed to 4.2.N.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@94886 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
89429: ACE-2649: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
89386: Merged DEV to 4.2-BUG-FIX (4.2.4)
85587: MNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@94626 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
85648: MNT-12422: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
85647: Merged V4.1-BUG-FIX (4.1.10) to V4.2-BUG-FIX (4.2.4)
85646: MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
- Need to reduce the timeout (50ms -> 30ms) of the test as it failed on HEAD-BUG-FIX after a new version of POI was merged from V4.2-BUG-FIX
as it completed the transform in 52ms which then tripped over the check that its should not have completed.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@85678 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
84255: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
84254: Reverse Merge V4.2-BUG-FIX (4.2.4)
<< Causes problems for other transformations such as those that start with an HTML file >>
77398: MNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit testMNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@84651 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
84058: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
83799: MNT-12238: Merged DEV 4.2-BUG-FIX (4.2.4) to V4.2-BUG-FIX (4.2.4)
MNT-12238: Merged 4.1-BUG-FIX (4.1.10) to V4.2-BUG-FIX (4.2.4)
80291: Merged V4.1.6 (4.1.6.21) to V4.1-BUG-FIX (4.1.10)
77378: Merged DEV PATCHES/V4.1.6 (19) to PATCHES/V4.1.6 (20)
76649: MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
- Patch from MNT-577 has been combined with new changes to avoid hanging of analyzing complicated PPTX documents. The fix just disables reading the entire contents of the complicated document. POI metadata extractor may be switched to standard behavior or reconfigured, using the following new properties: content.transformer.Poi.poiFootnotesLimit, content.transformer.Poi.poiExtractPropertiesOnly and content-services-context.xml/extracter.Poi/poiAllowableXslfRelationshipTypes
77379: MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
Test and the test data for MNT-577 have been added. Test for MNT-11823 has also been added. But this test is commented because the test data (appropriate PPTX document) is not currently available. Getters for POI specific properties have been added to 'PoiMetadataExtracter' for tests. Also 'afterPropertiesSet()' logic has been a bit modified to allow setting 'false' value for 'poiExtractPropertiesOnly' parameter
77561: MNT-11823: Upload of PPTX causes very high memory usage leading to system instability
Fix for https://bamboo.alfresco.com/bamboo/browse/HF-PATCH416-126 build failure. POI extractor and transformer properties of 'AlfrescoPoiPatchUtils' have been isolated from each other using context. Each extractor or transformer now has its own context or uses the default context. Properties of the default context allow parsing the entire contents of XLSF documents. And footnotes limit is 50. Property names have not been changed, but currently 'content-services-context.xml/extracter.Poi/poiAllowableXslfRelationshipTypes=null' does not lead to 'content.transformer.Poi.poiExtractPropertiesOnly=false'. I. e., this list may be empty. 'PoiMetadataExtracterTest' test has been modified in accordance with the introduced changes. 'poi-OOXML-3.9-beta1-20121109.jar' has been renamed to 'poi-OOXML-3.9-beta1-20121109-patched.jar'
79180: MNT-12043: CLONE - Upload of PPTX causes very high memory usage leading to system instability
Timeout mechanism has been added to content transformers. Timeout configuration options have been added. Also mechanism to close streams after 'TimoutException' has been added to transformers and metadata extractors. Also timeout mechanism for input streams has been enabled in 'TikaPoweredContentTransformer'
79268: MNT-12043: CLONE - Upload of PPTX causes very high memory usage leading to system instability
Fix for the https://bamboo.alfresco.com/bamboo/browse/HF-PATCH416-133 build failure and comments of the review https://fisheye.alfresco.com/cru/CR-100#CFR-1184. The new test has been added into 'PoiOOXMLContentTransformerTest.testMnt12043()' to check out the newly added timeout mechanism
79290: MNT-12043: CLONE - Upload of PPTX causes very high memory usage leading to system instability
- Removed methods and properties that are no longer needed
79327: MNT-12043: CLONE - Upload of PPTX causes very high memory usage leading to system instability
- Increased ADDITIONAL_PROCESSING_TIME to 1500ms to try and avoid a new intermittent test failure.
83885: MNT-12238 Bring Maven POM file in sync with latest patched version of poi-ooxml
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@84627 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
78913: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
78866: Merged V4.1-BUG-FIX (4.1.10) to V4.2-BUG-FIX (4.2.4)
78729: MNT-9257: Incorrect behaviour with password protected MS office documents
Check if failover transformers support transformation for mimetype before calling it. Add unit test.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@82655 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
78537: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
77398: MNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit testMNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@82590 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
68525: Merged V4.2-BUG-FIX (4.2.3) to HEAD-BUG-FIX (4.3/Cloud)
68281: MNT-11350: Upgrade of Tika to 1.6-SNAPSHOT
- Upgraded patched Tika, patched PDFBox, and other dependencies
- Added a MediaTypeDisablingDocumentSelector to disable parsing of certain embedded attachments
- Added TikaPoweredMetadataExtracter.buildParseContext method which sets the context's DocumentSelector if specified
- Added setting of DocumentSelector if specified to TikaPoweredContentTransformer
- Added MediaTypeDisablingDocumentSelector to the transformer.PdfBox bean config which disables parsing of embedded images
- Updated PdfBoxContentTransformerTest to disable the parsing of embedded images
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@70407 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
68226: Merged V4.2-BUG-FIX (4.2.3) to HEAD-BUG-FIX (4.3/Cloud)
68000: MNT-11015: Content indexing doesn't have transformation failover implementation
- Failed unit test is fixed.
- quick-differentMimetype.docx MIME type is changes from "application/msword" to "application/pdf"
- If required transformer isn't found the more detailed message will be displayed into AssertionFailedError.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@68474 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
57475: Merged V4.2-BUG-FIX (4.2.1) to HEAD-BUG-FIX (Cloud/4.3)
57285: MNT-9854 (ALF-17856) Ability to Easily Disable Certain Source and Target Media Types in ImageMagick Transformer
- Added a unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@61817 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Modified the way content.transformer.<transformer name>.extensions.<source>.<target>*.*.supported=[true|false] work
so they are still backward compatible AND one can define if unmatched mimetypes are supported or not. For example:
content.transformer.remoteServer.extensions.*.*.supported=false
- Added properties to transformers.properties so that the Transformation server ONLY supports the transformations
it did in 4.1.5 AND allows customers to change just one property to allow the Transformation Server to do all
transformations that it can with the exception of those that go to text, as generally the default transformers do
a better job and are faster.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@55704 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261