* Support the creation of thumbnails for 2008/9 and 2013/14 iWorks file types.
* Support the preview of 2008/9 and 2013/14 iWorks file types. It should be noted that the
embedded PDF files from 2008/9 no longer are used for the preview, so the quality is not
as good as it was in the past. It is however using the embedded JPEG and quality is reasonable.
* AppleIWorksContentTransformer no longer supports PDF as an target mimetype, as the newer
iWoks 2013/14 formats no longer includes an embedded PDF file. Having PDF supported by the
transformer resulted in Share always trying PDF which was one of the reasons that the newer
formats did not preview. The other reason was that the embedded JPEG filename also changed.
* Added 'quick' files for testing.
* Also modified the exception message thrown when the iWorks file did not contain an embedded
preview file, which is optional. An example would is "The source numbers file did not contain
a jpg preview", rather than "Unable to transform numbers file to jpg". This should reduce the
number of issues raised with support.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@135869 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- part 2 - enable "addTags" to handle configurable list of separators (when using "enableStringTagging" option of "extract-metadata" action)
- initial default separators/delimiters => comma, semi-colon & vertical bar (pipe)
- also means we can re-enable & fix ContentMetadataExtractorTagMappingTest
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@135061 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
<< Root cause for these 4 issues was a fix made in 4.2.4 MNT-11279. That fix was trying to
use the file name field value in a document when transformed to PDF. In addition to the
3 problems found, using file names supplied by users is a potential security issue.
We have not found a way to exploit the use of the original file name, but that should
not stop us from closing off the possibility. Fix is being merged to the dot N branches
of 5.2, 5.1, 5.0 and 4.2 so the fix will be in the next service pack for each branch.
The original MNT-16609 fix is also being reverted as the original code included the source
extension in the temporary file name. >>>
Reverse merged 5.2.N (5.2.1)
129407 skopf: MNT-16609 - Alfresco OOTB install doesn't show preview for .EML files on Windows
94626 adavis: Merged HEAD-BUG-FIX (5.1/Cloud) to HEAD (5.1/Cloud)
89429: ACE-2649: Merged V4.2-BUG-FIX (4.2.4) to HEAD-BUG-FIX (5.0/Cloud)
89386: Merged DEV to 4.2-BUG-FIX (4.2.4)
85587: MNT-11279: Docx Preview: FileName in the document header not previewing correctly
- Keep original name during convertion.
- Add unit test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@133715 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- StringExtractor is now able to extract text so searching is now working, which was the main reason for this Customer issue.
- PdfBox.TextToPdf is now able to transform .dita fiels to .pdf. As a result it is also possible to preview the files (although the xml tags are visible) and
to generate .png thumbnails using the complex.Text.Image transformer (which has an intermediate .pdf stage).
- As expected TikaAuto (or our patched 1.6 version) does not know about the DITA mimetype application/dita+xml so is not claiming to transform this type even
though it is just an XML file.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@130609 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Addition of a while list of Tika derived mimetypes and declared mimetypes that are deemed to be safe (e.g. for .ai and .pdf).
- Based on Andrei's patch (see CR-546), but with the strictMimetypeCheck configuration and logic moved to TransformerConfig
- Rework of DifferrentMimeTypeTest (was not in the patch) as it had become hard to understand.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@130139 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Ensure error when indexer is the caller includes the file name rather than null.
NodeContentGet now calls transform method with TransformationOptions parameter and the original
transformer method was deprecated as it should be been when this extra parameter was added.
- Set the strictMimeTypeCheck and retryTransformOnDifferentMimeType properties on dynamically created
pipeline and fail over transformers from global properties as is don for static transformers.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@128569 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
- Addition of AbstractContentTransformer2.strictMimeTypeCheck(...) to check that the
declared mimetype is the same as the Tika derived mimetype. Controlled by a new alfresco
global property content.transformer.strict.mimetype.check (set to true by default).
- Added the UnsupportedTransformationException to a number of transformation callers as
they should have been handling it already.
- Modified DifferrentMimeTypeTest to test the new strictMimeTypeCheck.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@128168 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
125892 adragoi: Merged 5.0.N (5.0.4) to 5.1.N (5.1.2)
125842 rmunteanu: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4) (PARTIAL MERGE)
125700 adavis: Merged V4.2.5 (4.2.5.7) to V4.2-BUG-FIX (4.2.7)
125698: Merged DEV to V4.2.5 (4.2.5.7)
125677 arebegea: MNT-15219 : Excel (.xlsx) containing xmls (shapes/drawings) with multi byte characters may cause OutOfMemory in Tika
- Should not have updated version.properties as the original commit needs to be merged forwards.,
125696: Merged DEV to V4.2.5 (4.2.5.7)
125677 arebegea: MNT-15219 : Excel (.xlsx) containing xmls (shapes/drawings) with multi byte characters may cause OutOfMemory in Tika
- Modified tika parser and tika core jars to allow some configuration parameters to be sent from Alfresco side using the metadata map parameter
- Excluded by default the parsing of drawings/shapes xmls because there was little valuable data that could be extracted from those xmls
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@126004 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
124313 adragoi: Merged 5.0.N (5.0.4) to 5.1.N (5.1.2)
124244 abalmus: MNT-15497 : Keyword tags generated from metadata extraction are formed into a single string rather than split on delimiter
- Fixed tag separation on delimiter
- Enhanced existing test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@124364 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
124145 amorarasu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.2)
124106 rmunteanu: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4)
124105 amorarasu: Merged V4.1-BUG-FIX (4.1.11) to V4.2-BUG-FIX (4.2.7)
123915 rmunteanu: Merged V4.1.7 (4.1.7.18) to V4.1-BUG-FIX (4.1.11)
MNT-15801 : onContentUpdate policies not working for updates via webdav
- Reverted changes made for MNT-11185 and MNT-10966 , reverts that will fix MNT-15746
- Changed fix for MNT-5882 by adding the ASPECT_NO_CONTENT after the content is updated, this way preventing MNT-10966 from reproducing
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/BRANCHES/DEV/5.2.N/root@124228 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
123502 adavis: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
123484 adavis: MNT-14295 TransformerConfigLimits Behavior Differs on Java 7 vs 8
- Fixed TransformerPropertyNameExtractor so that it discarded limit or
max properties if its sibling max or limit value is supplied and has
a lower value (i.e. it should be used).
- Cleaned up TransformerConfigLimits.setTransformationLimitsFromProperties
- This has been a problem since 4.2.
- The problem only came to light with the switch to Java 8, as the order in which
transformer properties are loaded changed. They were being loaded in MapEntry order.
The problem did not show up in the unit tests, because they too were impacted by
the MapEntry order and ironically loaded the properties in the same order.
- Added unit tests that are supply both max an limit values.
- Enhanced the debug and toString methods in order to trace the problem.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123711 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
123030 rmunteanu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
122981 rmunteanu: MNT-15167 : PageLimit value no longer pick up in 5.x with or without Transformation Server installed
- Preserved transformer properties order.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123692 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
122983 rmunteanu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
122910 rmunteanu: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4)
122808 adavis: MNT-15738 Property to disable transformers by use (e.g. doclib thumbnail generation) in 4.2 is not working
- Allow 'use' style properties to be set
- Combine 'use' properties more flexibly with the default ones.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123686 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
122665 aleahu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1) (PARTIAL MERGE)
122615 rneamtu: Merged DEV to 5.0.N (5.0.4)
122454 rneamtu: MNT-14602: Add config to support Visio 2013 previews and thumbnails
- Added support for vsdx mimetype
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123682 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
121797 rneamtu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
121765 nsmintanca: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4)
121735 rneamtu: 121670: MNT-13814 : Using MS word Save As pdf does not work correctly.
- Fixed failing test
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123652 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
121796 rneamtu: Merged 5.0.N (5.0.4) to 5.1.N (5.1.1)
121764 nsmintanca: Merged V4.2-BUG-FIX (4.2.7) to 5.0.N (5.0.4)
121709 rneamtu: Merged DEV to V4.2-BUG-FIX (4.2.7)
121670 rneamtu: MNT-13814 : Using MS word Save As pdf does not work correctly.
- Added new mimetype "application/applefile" and set it for files that starts with "._"
- Added new test for case
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@123651 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
122540 adragoi: Merged DEV to HEAD (5.1)
122252 ACE-5093 : Allow file content URLs to be generated by a provider
- implemented TimeBasedFileContentUrlProvider,VolumeAwareFileContentUrlProvider and test classes
121979 ACE-5093 : Allow file content URLs to be generated by a provider
- added deprecated annotation
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@122918 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
122252 ACE-5093 : Allow file content URLs to be generated by a provider
- implemented TimeBasedFileContentUrlProvider,VolumeAwareFileContentUrlProvider and test classes
121979 ACE-5093 : Allow file content URLs to be generated by a provider
- added deprecated annotation
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@122540 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
107541: Merged 5.0.N (5.0.3) to HEAD-BUG-FIX (5.1/Cloud) (PARTIAL MERGE)
107413: Merged DEV to 5.0.N (5.0.3)
106858 : MNT-13545: JavaDoc : Inconsistencies between the Java doc and the actual code
- Cleaning of Javadoc,
107565: MNT-13545 Fix compilation after merge of Javadoc
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@107633 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
106092: Merged 5.0.N (5.0.3) to HEAD-BUG-FIX (5.1/Cloud)
106073: Merged V4.2-BUG-FIX (4.2.5) to 5.0.N (5.0.3)
106001: Merged V4.1-BUG-FIX (4.1.11) to V4.2-BUG-FIX (4.2.5)
105937: Merged DEV to V4.1-BUG-FIX (4.1.11)
104782: MNT-13819 : Alfresco throws "TooBusyException" when uploading several files for transformation
-close TikaPoweredContentTransformer writer if input stream fails
-TransfomerActionExecutor : do not used shared transformationOption member for multithred transformation requests
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@106099 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261
105991: Merged 5.0.N (5.0.3) to HEAD-BUG-FIX (5.1/Cloud)
105915: Merged V4.2-BUG-FIX (4.2.5) to 5.0.N (5.0.3)
105898: MNT-13901 TransformationOptions Timeout Can Not Easily Be Set
- Fix "TransformationOptionPair.setMax(long max) and TransformationOptionPair.setLimit(long limit) so that they don't
clear the other half of the pair when the value being set is < 0 (unset).
- Unit tests added
- Added comments about how the 'transformer' code builds up limits to include defaults.
git-svn-id: https://svn.alfresco.com/repos/alfresco-enterprise/alfresco/HEAD/root@106013 c4b6b30b-aa2e-2d43-bbcb-ca4b014f7261