Oliver S. Posted December 6, 2023 Posted December 6, 2023 After update to V2.x.x large PDF's over 1GB file size cannot be opened in publisher or designer. Trying to place them into a document also fails and causes programm crashes. File sizes are from 3 to 5,4 GB. My target was to modify them to a document you can work with. System settings like RAM were correct according to the available RAM on the machine. Example: 16GB system RAM available -> max 10GB for Publisher App Quote Location Germany, Timezone CEST Berlin MacOS & and soon additional WIN11 using Affinity Photo, Publisher and Designer
Hangman Posted December 6, 2023 Posted December 6, 2023 Hi @Oliver S., The first obvious question is going to be could you upload perhaps one of the smaller files so we can take a look... I'm unsure what the file size limit for uploads to the forum is but I know files in excess of 1Gb have been successfully uploaded in the past... Without seeing a sample file I think it's going to be difficult to diagnose potential causes of the crash though a crash report may also help... Quote Affinity Designer 2.6.3 | Affinity Photo 2.6.3 | Affinity Publisher 2.6.3 MacBook Pro M3 Max, 36 GB Unified Memory, macOS Sonoma 14.6.1, Magic Mouse HP ENVY x360, 8 GB RAM, AMD Ryzen 5 2500U, Windows 10 Home, Logitech Mouse
Oliver S. Posted December 6, 2023 Author Posted December 6, 2023 Hi Hangman, uploading this files in the forum should be a little nightmare. What i can do is opening one of this files again with publisher and paste the crash report from apple here. Here the link to the files from German Federal Archive (Website operated by the Federal Republic of Germany). In my case it was the 'BD2' on the right side wich i use for a research work. Take care it takes a lot of time due very lousy download rates. https://www.bundesarchiv.de/DE/Content/Artikel/Benutzen/Hinweise-zur-Benutzung/Unterseiten-Militaer/Militaerische-Verbaende-und-Einheiten/benutzen-speziell-milit-verbaende-einheiten-tessin.html The PDF-files are ocr scanned from books from the German Military Archieve. The idiot who scanned it did it in colour instead of b/w and includes a lot of space wich did not belongs to the book. I found a workaround for the problem. Opening the 350 pages files with "Preview" delete 300 pages and rename it "***1-50.pdf" Copy the complete file in the same folder again, delete the next pages and rename to "***51-100.pdf". Each file is then around 750mb and can be opened and it's possible to modify the file to b/w, remove the incorrect OCR data and resize it. regards Oliver Quote Location Germany, Timezone CEST Berlin MacOS & and soon additional WIN11 using Affinity Photo, Publisher and Designer
walt.farrell Posted December 6, 2023 Posted December 6, 2023 14 minutes ago, Oliver S. said: I found a workaround for the problem. Opening the 350 pages files with "Preview" delete 300 pages and rename it "***1-50.pdf" Have you tried Opening the full file in Publisher, but when prompted for what to load, telling it to load pages 1-50? Quote -- Walt Designer, Photo, and Publisher V1 and V2 at latest retail and beta releases PC: Desktop: Windows 11 Pro 23H2, 64GB memory, AMD Ryzen 9 5900 12-Core @ 3.00 GHz, NVIDIA GeForce RTX 3090 Laptop: Windows 11 Pro 23H2, 32GB memory, Intel Core i7-10750H @ 2.60GHz, Intel UHD Graphics Comet Lake GT2 and NVIDIA GeForce RTX 3070 Laptop GPU. Laptop 2: Windows 11 Pro 24H2, 16GB memory, Snapdragon(R) X Elite - X1E80100 - Qualcomm(R) Oryon(TM) 12 Core CPU 4.01 GHz, Qualcomm(R) Adreno(TM) X1-85 GPU iPad: iPad Pro M1, 12.9": iPadOS 18.5, Apple Pencil 2, Magic Keyboard Mac: 2023 M2 MacBook Air 15", 16GB memory, macOS Sequoia 15.5
Oliver S. Posted December 6, 2023 Author Posted December 6, 2023 Hello Walt, unfortunately i tried to open the full file. As described i also started to place it into a document with bad result. Where do i find these 'PDF Options' ..... never seen this in Publisher, looks that this can help also 😀. Tomorrow i will set up a windows 11 machine, then we'll have the same platform. Normally you find me in the Apple corner. regards Oliver Quote Location Germany, Timezone CEST Berlin MacOS & and soon additional WIN11 using Affinity Photo, Publisher and Designer
walt.farrell Posted December 6, 2023 Posted December 6, 2023 54 minutes ago, Oliver S. said: Where do i find these 'PDF Options' ..... never seen this in Publisher, looks that this can help also They're from File > Open. Quote -- Walt Designer, Photo, and Publisher V1 and V2 at latest retail and beta releases PC: Desktop: Windows 11 Pro 23H2, 64GB memory, AMD Ryzen 9 5900 12-Core @ 3.00 GHz, NVIDIA GeForce RTX 3090 Laptop: Windows 11 Pro 23H2, 32GB memory, Intel Core i7-10750H @ 2.60GHz, Intel UHD Graphics Comet Lake GT2 and NVIDIA GeForce RTX 3070 Laptop GPU. Laptop 2: Windows 11 Pro 24H2, 16GB memory, Snapdragon(R) X Elite - X1E80100 - Qualcomm(R) Oryon(TM) 12 Core CPU 4.01 GHz, Qualcomm(R) Adreno(TM) X1-85 GPU iPad: iPad Pro M1, 12.9": iPadOS 18.5, Apple Pencil 2, Magic Keyboard Mac: 2023 M2 MacBook Air 15", 16GB memory, macOS Sequoia 15.5
Oliver S. Posted December 7, 2023 Author Posted December 7, 2023 Hello Walt, no such options in "open file" see attached screenshot. Maybe a special option for Windows based systems or had to be activated in the preferneces. MacOS crash report from the opening of the PDF-file is attached. Affinity Publisher crash report - large files.txt Quote Location Germany, Timezone CEST Berlin MacOS & and soon additional WIN11 using Affinity Photo, Publisher and Designer
Hangman Posted December 7, 2023 Posted December 7, 2023 Hi @Oliver S., I see an instant crash when attempting to open Bd_2_ocr.pdf in Publisher v1.10.8, v2.0.4 and v2.3.0. This is also why you're not seeing the options @walt.farrell shows in his screengrab, the file is crashing Publisher before the PDF Options window appears. Crash Reports Affinity Publisher-2023-12-07-094845.ips Affinity Publisher 2-2023-12-07-094804.ips Affinity Publisher 2 Affinity Store-2023-12-07-094736.ips 16 hours ago, Oliver S. said: The idiot who scanned it did it in colour instead of b/w and includes a lot of space which did not belongs to the book. Adopting a different approach I converted the PDF to a Greyscale version using Ghostscript which reduces the file size from 5.37 GB to 235 MB. It's one to set running while going off to make a cup of coffee but you will see the progress page by page... Quote gs -sOutputFile=Bd_2_ocr_Greyscale.pdf -sDEVICE=pdfwrite -sColorConversionStrategy=Gray -dProcessColorModel=/DeviceGray -dAutoRotatePages=/None -dCompatibilityLevel=1.7 -dNOPAUSE -dBATCH Bd_2_ocr.pdf The scanning is a bit silly as the pages could have been cropped though that is easy enough to do in Publisher or Apple Preview. Here is the converted version which opens quite happily in Publisher... If you need it in a different PDF version then simply set -dCompatibilityLevel=1.7 accordingly, e.g., to -dCompatibilityLevel=1.4 You can likewise remove redundant pages using the command line rather than doing so in Publisher which would reduce the file size further. On a side note, you will notice that because the book wasn't perfectly straight when scanned the invisible OCR text layer is broken into small chunks on some pages meaning it's difficult to copy and paste the text if that is the intention... If you happen to have an iPhone or iPad running iOS 15 or later you will find Live Text is extremely helpful as it allows you to copy the text for entire pages and paste it as live, fully editable text in a new document. Let us know if this helps at all... Bd_2_ocr_Greyscale.pdf walt.farrell 1 Quote Affinity Designer 2.6.3 | Affinity Photo 2.6.3 | Affinity Publisher 2.6.3 MacBook Pro M3 Max, 36 GB Unified Memory, macOS Sonoma 14.6.1, Magic Mouse HP ENVY x360, 8 GB RAM, AMD Ryzen 5 2500U, Windows 10 Home, Logitech Mouse
Oliver S. Posted December 7, 2023 Author Posted December 7, 2023 Hi Hangman, thanks for your hint to "Ghostscript" wich i didn't know. This will help a lot. It seems that there was also an issue with the PDF-version. Now it is possible tho choose the options from Walt during opening the file. The work on the document like removing the bad OCR data and resizing or cropping the pages is now much easier to do. Many thanks 🙂 regards Oliver Quote Location Germany, Timezone CEST Berlin MacOS & and soon additional WIN11 using Affinity Photo, Publisher and Designer
Hangman Posted December 7, 2023 Posted December 7, 2023 Hi @Oliver S., That's no problem at all, happy to help... The original file is a PDF 1.7 file as is the version I've converted to Greyscale... It would still be helpful to understand why the file causes Publisher to instantly crash so I'm hoping someone in the moderation team will be able to shed some light on that... Quote Affinity Designer 2.6.3 | Affinity Photo 2.6.3 | Affinity Publisher 2.6.3 MacBook Pro M3 Max, 36 GB Unified Memory, macOS Sonoma 14.6.1, Magic Mouse HP ENVY x360, 8 GB RAM, AMD Ryzen 5 2500U, Windows 10 Home, Logitech Mouse
Dan C Posted December 7, 2023 Posted December 7, 2023 Thanks for your report @Oliver S.! I can confirm I've been able to replicate this crashing issue and I've reported it to our development team for further investigation, as I cannot see a clear reason for this crash to occur currently. Interestingly on Windows, the Affinity app shows a 'File type not supported' error when trying to open this PDF. I've been able to import a different large PDF document (3.8GB) without this crash occurring, so I don't believe it's specifically related to the size of the PDF file itself - though it's interesting to know that after resaving a reduced version of this PDF though Preview, Affinity was able to import the document & I'll be sure to include this in the development log. Many thanks to @Hangman for the additional information and workaround provided above! Hangman 1 Quote
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.