charlesbewlay Posted January 19, 2024 Posted January 19, 2024 I'm having a serious problem with importing a long document from Word with three heading levels. I imported what looks like a perfectly formatted Word document as per screenshot 1. What I get is as per screenshot 2. I've tried to fix this in many ways, changing styles in Word, and trying to fix in AfPub, over many hours and days. This is setting me badly behind (schedule.) But to no avail at all. I hope someone might be able to help me with this. I'd be ever so grateful, as I'm dreading to have to go back to an InD subscription. Charles P.S. The text being in blue is another matter, but I can fix that, despite remaining a mystery. Screenshot 1. png Quote
MikeTO Posted January 19, 2024 Posted January 19, 2024 I'd need to see a sample of the document to figure it out. I created this test document using the multi-level list feature of Word and it imported perfectly into Publisher. test.docx Quote Download a free PDF manual for Affinity Publisher 2.6 Download a quick reference chart for Affinity's Special Characters Affinity 2.6 for macOS Sequoia 15.5, MacBook Pro (M4 Pro) and iPad Air (M2)
Old Bruce Posted January 19, 2024 Posted January 19, 2024 I would think there is something in the Word DOCX file that has created the various Heading styles. I would search for in the Publisher file. Check in Word to see if those errant headings are defined. Quote Mac Pro (Late 2013) Mac OS 12.7.6 Affinity Designer 2.6.0 | Affinity Photo 2.6.0 | Affinity Publisher 2.6.0 | Beta versions as they appear. I have never mastered color management, period, so I cannot help with that.
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 12 hours ago, lacerto said: If your workflow is importing a Word document that is basically more or less fully style tagged, I would import that file in a Publisher document that has all default styles (not used) deleted. That would avoid having multiple styles with identical or nearly identical style names cluttered in the layout and confusing formatting of the document. Publisher does not have a capability to signal conflicting styles and let the user design at import time, whether to use Publisher-defined styles with the (matched/similar) style names, mapping the imported and existing styles, or overwriting Publisher styles with imported style definitions, and avoid what you describe as "style nightmare". So if you have a good arrangement in Word already, I would recommend importing into a document that is as much as possible cleared of in-built styles. EDIT: This would be a workable solution even if not having Word styles well-defined. Just having source text tagged with paragraph and character style names and finalizing the actual style definitions in Publisher, should work well. It is the style name conflicts that are probably the biggest nuisance in preparation of layout. Thanks lacerto. I've been deleting all Publisher styles before importing for a good while now: a lesson early learnt. I'll see what more I can do with the word styles now, and follow what the other guys responding suggest. Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 13 hours ago, MikeTO said: I'd need to see a sample of the document to figure it out. I created this test document using the multi-level list feature of Word and it imported perfectly into Publisher. test.docx 13 kB · 0 downloads Thanks Mike. I note you have used List Paragraph rather than Heading 1, 2 etc. And your sample is fine. I duplicated what you did and can even add first level. But in List Styles (screenshot) under numbering 'No list' is selected. But when I try to replicate that I have to select 1/1.1/1.11. Mighty mysterious. I attach my sample, the Word version and the AfPub version. Even though all changes are accepted in Word's Review, and in Tacking, no markup and nothing is selected in the options, AfPub seems to import a lot of chaos after the bottom of page 14, and also adds deleted pages in the prelims. NOne of that happens if I import the full document. Any way forward? BOCRA sample for forum.afpub Sample UNDERSTANDING COMMS FINAL GALLEY PROOF.docx Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 15 hours ago, Old Bruce said: I would think there is something in the Word DOCX file that has created the various Heading styles. I would search for in the Publisher file. Check in Word to see if those errant headings are defined. Nope, nothing like that in Word version. I also deleted unused styles, but Word keeps a great lot anyhow. It looks to me like Word on Windows has more control than on the Mac. looking at what lacerto just posted. Quote
iconoclast Posted January 20, 2024 Posted January 20, 2024 I would recommend - as I already learned in my apprenticeship - always to load only unformated text (e.g. saved as a *.txt-file, that doesn't allow formatings by design). Formatings often cause problems and should always be done in the DTP-Software, not in the text editor. Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 Well, as I learnt, as a book publisher, produce any number of galley proofs before dropping into pages. I've never had problems like I'm encountering now. Westerwälder 1 Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 1 hour ago, lacerto said: I downloaded your Word file and it has revision marks, which Affinity Publisher reads in. You should accept all revisions (on the Review tab) and stop tracking and then import the cleaned file: (Again, this is probably a bit different on macOS.) Note that Publisher also imports all hidden text so if you have obsolete styles there, these styles would also be imported. The original text had all accepted and was fine. the sample is crashing Word even after a restart, app and machine. I've now dropped the original into Publisher and cut 100+ pages to make a sample, so that's attached. Sample 2 BOCRA Pages.afpub Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 27 minutes ago, lacerto said: I had a look on the Word document that you posted, and when I tried to accept changes and stop tracking, Word stops responding. The same happens also on macOS Word, and LibreOffice Writer cannot handle the file, either. But I took it to Apple's Pages, which auto-accepted all tracking changes in tables and other places where it cannot do tracking, and then I accepted manually all changes in Pages, and removed also comments. You can find attached a cleaned file. I suggest that you do the same process yourself (Pages is free app on App Store) to see that what Pages does is correct. [EDIT: It does not seem so, so the original document might just be corrupt, and require manual cleaning.] Anyway, when I imported the cleaned document in Publisher, the style nightmare seems to be over. Sample UNDERSTANDING COMMS FINAL GALLEY PROOF_cleaned.docx 24.16 kB · 0 downloads Looks great what you've done! Thanks. Something wrong with the Word sample for sure. I'll try to replicate, but will be away for a couple of hours or so. Quote
iconoclast Posted January 20, 2024 Posted January 20, 2024 3 hours ago, charlesbewlay said: Well, as I learnt, as a book publisher, produce any number of galley proofs before dropping into pages. I've never had problems like I'm encountering now. Well, I'm not a book publisher. I'm a media designer. I learned to prepare images in Photoshop, create graphics in Freehand, later in Illustrator, and DTP using Quark XPress, later InDesign, about twenty years ago, and now I practice it in Publisher. And we always used to create the text in text editors first, save it unformated and then load it in the DTP-Software, to do all the layout work, including the formating, because this is a well ordered and reliable workflow, that prevents needless problems. Old Bruce 1 Quote
MikeTO Posted January 20, 2024 Posted January 20, 2024 I did a lot of testing and in a nutshell, importing multi-level lists from MS Word doesn't work. For Serif, this simple multi-level list in MS Word imports incorrectly into Publisher. With a blank document with all text styles deleted, place this test file into a frame. The list of styles will include some nonsense styles and the heading styles won't be properly defined so the lists will be broken. testing.docx For Serif, I found a second bug while looking at Charles' document. Tables form Word files aren't formatted with text styles after placing into Publisher, they are set to No Style. With a blank document with all text styles deleted, place this test file into a frame. The table text will be formatted as No Style after placing - it's formatted as Heading 2 in MS Word. test.docx For Charles: I believe the issue within being unable to scroll the style list is a known bug. Yes, you must accept all tracking changes before importing text into Publisher. I will add a tip to that effect in my manual. The sample Word file hung MS Word for me, too, when I tried to accept all changes, requiring a force quit. I fixed it with Pages as suggested above so I could play with it but I don't recommend that - Pages made a mess of the text styles and the headings became formatted with the Page Number style. It's going to take some effort to make this work in Publisher but here's how to do it. Open the document in Word. Add a temporary paragraph outside of the table and format it as style "Table Left". That style is used in your tables but nowhere else and because Publisher doesn't style the table text the text style won't be created in Publisher. You need this style so create a temporary paragraph formatted with the style to ensure the style is imported. Save the file. Place the modified file into Publisher. Delete that temporary paragraph and format the table text as "Table Left". This will solve the problem of all the table text and paragraphs styled as "TEXT" being formatted as lists. Go to the first paragraph numbered 0.1 and use Paragraph > Bullets and Numbering to fix it. Deselect Restart Numbering and change the list name from "6" to "2" which is the name Publisher assigned to the parent list. Now it will be numbered 1.2. Using the Text Styles panel, click the menu icon to the right of Heading 2 and choose Update Heading 2. For any other mis-numbered Heading 2 paragraphs, just re-apply Heading 2 to them to clear the formatting overrides. The next problem will be the first 0.0.1 paragraph. Deselect Restart Numbering and change the list name from "9" to "2". Using the Text Styles panel, click the menu icon to the right of Heading 3 and choose Update Heading 3. For any other mis-numbered Heading 3 paragraphs, just re-apply Heading 3 to them to clear the formatting overrides. This should clean it up although it will take some effort. Cheers Quote Download a free PDF manual for Affinity Publisher 2.6 Download a quick reference chart for Affinity's Special Characters Affinity 2.6 for macOS Sequoia 15.5, MacBook Pro (M4 Pro) and iPad Air (M2)
Old Bruce Posted January 20, 2024 Posted January 20, 2024 13 minutes ago, MikeTO said: For Serif, this simple multi-level list in MS Word imports incorrectly into Publisher. With a blank document with all text styles deleted, place this test file into a frame. The list of styles will include some nonsense styles and the heading styles won't be properly defined so the lists will be broken. I found this in the Edit Text Styles panel for heading 1. The heading 1 1 is set for Next style. If I set that Next style to Normal and then delete unused styles all the "nonsense styles" will disappear. Some caveats are that I don't have the various fonts defined in the various text styles and also I have to use Pages and LibreOffice because I do not own Word. Not having access to Word I cannot see if there is some thing set to create extra numbered lists. Quote Mac Pro (Late 2013) Mac OS 12.7.6 Affinity Designer 2.6.0 | Affinity Photo 2.6.0 | Affinity Publisher 2.6.0 | Beta versions as they appear. I have never mastered color management, period, so I cannot help with that.
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 7 hours ago, lacerto said: It is interesting that Pages seems to be able to handle the cleaned Word file just fine -- see attached the PDF that I exported from Pages. Sample UNDERSTANDING COMMS FINAL GALLEY PROOF.pdf 205.11 kB · 1 download However, if I try to import in Publisher, I get oddly messed up heading numberings, numbered paragraphs, etc. If I import in InDesign, results are better but not nearly the same as in Pages. UPDATE: I looked at the second Publisher sample you posted, and its heading numbering is incorrect, so the paragraph styles should be defined to have numbered lists with correct levels, and then the existing style assignments should be reapplied to get numbering right. UPDATE2: I fixed the major heading numbering styles (heading 1, heading 2 and heading 3) and automatically reapplied styles by using Find Replace (searching heading 1, and replacing with the same style, etc.): Sample 2 BOCRA Pages_fixed.afpub But there are many lists that still need to fixed, not just numbering but also formatting. Note that you can restart numbering from the currently selected paragraph by using the option in the Paragraph panel: n Lacerto, I'm gobsmacked!!! How can I ever thank you enough?? The other paras are a relatively easy fix. I'll sleep better tonight. I'd been trying the Pages route, but that also had problems. At least it now does footnotes, but no indexing (powerful in AfPub!). It seems like there is no import option, only Open, so that gives page layout problems, and using Convert to Page Layout just erases all text. Anyhow another story, but has it uses for sure, so thanks for reawaking me up to it as well. Quote
charlesbewlay Posted January 20, 2024 Author Posted January 20, 2024 4 hours ago, iconoclast said: Well, I'm not a book publisher. I'm a media designer. I learned to prepare images in Photoshop, create graphics in Freehand, later in Illustrator, and DTP using Quark XPress, later InDesign, about twenty years ago, and now I practice it in Publisher. And we always used to create the text in text editors first, save it unformated and then load it in the DTP-Software, to do all the layout work, including the formating, because this is a well ordered and reliable workflow, that prevents needless problems. Yes, we are in different worlds really. I'd do the same as you if graphics were a big part of what I do. I started with Pagemaker, and a bit of Freehand! Quote
Oufti Posted January 20, 2024 Posted January 20, 2024 4 hours ago, lacerto said: LibreOffice […] would therefore probably be the best tool to fix corrupted Word documents. It's definitely commonly advised to use LibreOffice to open and resave as .docx a corrupted Word file, as this software interprets quite well all Word features and re-encode them at export. For example: https://answers.microsoft.com/fr-fr/msoffice/forum/all/ouverture-du-fichier-impossible-fichier-corrompu/9a89a23c-b68e-414c-9f20-83ac6b67b493 Quote Affinity Suite 2.5 – Monterey 12.7.5 – MacBookPro 14" 2021 M1 Pro 16Go/1To I apologise for any approximations in my English. It is not my mother tongue.
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.