Pyanepsion Posted August 31 Share Posted August 31 Hello everyone, I'm faced with a situation where an initial text contains 1,279 opening inverted commas (“) and 1,238 closing inverted commas (”). Would you know how to quickly and effectively identify the mismatched inverted commas to solve this problem in the best possible way? Thank you in advance for your expert advice. Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
MikeTO Posted August 31 Share Posted August 31 You should be able to do it with GREP (here's an ID example: https://www.reddit.com/r/indesign/comments/urqld4/grep_for_finding_an_open_quotation_mark_with_no/) but when I'm fixing stuff like this I don't like to think too hard and just do it the brute force way: Copy the text to any other app such as a text editor or MS Word Copy half of it to a new document and use Find to count the opening and closing quotes If they are balanced, close that document because that half is okay If they are unbalanced, copy half of of it to a new document Eventually, you'll have eliminated most of the text which has only balanced quotes and identify the handful of cases with unbalanced quotes. You could probably do this in five minutes. Quote Download a free PDF manual for Affinity Publisher 2.5 Download a quick reference chart for Affinity's Special Characters Affinity 2.5 for macOS Sequoia 15.1, MacBook Pro 14" (M4 Pro) Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Quote Link to comment Share on other sites More sharing options...
Pyanepsion Posted September 1 Author Share Posted September 1 When preparing the DTP for the American translation of a French hit, errors appeared during the conversion of dialogues: in French, lines begin with a dash and end with the paragraph, whereas in English they are enclosed in quotation marks without paragraph constraints. The dichotomous search was not appropriate here, as a balance of errors in one part of the text makes them invisible. For example, a sequence like ‘open, open, close, open, close, close’ would not be detected. The Regex identified two additional errors. I eventually automated the coloring of the quotation marks and the text between them. In the end, it took two days to create this routine and correct the quotation marks. 🟩 It would be useful if the Affinity suite could automatically detect elements of a pair without their counterparts, such as quotation marks, brackets, braces, parentheses, or hyphens. For example, an opening quotation mark should not follow another unclosed opening mark, and a closing quotation mark should not appear after another closing mark. Additionally, the search for the corresponding element could be confined to within the chapter. Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
walt.farrell Posted September 1 Share Posted September 1 5 minutes ago, Pyanepsion said: For example, an opening quotation mark should not follow another unclosed opening mark You can detect that with regular expressions if you know you want to look for it. But while that may be true for quotation marks, it is not generally true for brackets, braces, and parentheses can be properly nested (Open, Open, Close, Close). Quote -- Walt Designer, Photo, and Publisher V1 and V2 at latest retail and beta releases PC: Desktop: Windows 11 Pro 23H2, 64GB memory, AMD Ryzen 9 5900 12-Core @ 3.00 GHz, NVIDIA GeForce RTX 3090 Laptop: Windows 11 Pro 23H2, 32GB memory, Intel Core i7-10750H @ 2.60GHz, Intel UHD Graphics Comet Lake GT2 and NVIDIA GeForce RTX 3070 Laptop GPU. Laptop 2: Windows 11 Pro 24H2, 16GB memory, Snapdragon(R) X Elite - X1E80100 - Qualcomm(R) Oryon(TM) 12 Core CPU 4.01 GHz, Qualcomm(R) Adreno(TM) X1-85 GPU iPad: iPad Pro M1, 12.9": iPadOS 18.1, Apple Pencil 2, Magic Keyboard Mac: 2023 M2 MacBook Air 15", 16GB memory, macOS Sequoia 15.0.1 Link to comment Share on other sites More sharing options...
walt.farrell Posted September 1 Share Posted September 1 Also, in English, quotation marks can be nested which can complicate detection/matching. Open ", open ', open ", close ", close ', close " could be valid for example. That makes it difficult, perhaps impossible, to devise a general scheme that would work for all texts, even without considering the difficulty of different rules for different languages. Old Bruce 1 Quote -- Walt Designer, Photo, and Publisher V1 and V2 at latest retail and beta releases PC: Desktop: Windows 11 Pro 23H2, 64GB memory, AMD Ryzen 9 5900 12-Core @ 3.00 GHz, NVIDIA GeForce RTX 3090 Laptop: Windows 11 Pro 23H2, 32GB memory, Intel Core i7-10750H @ 2.60GHz, Intel UHD Graphics Comet Lake GT2 and NVIDIA GeForce RTX 3070 Laptop GPU. Laptop 2: Windows 11 Pro 24H2, 16GB memory, Snapdragon(R) X Elite - X1E80100 - Qualcomm(R) Oryon(TM) 12 Core CPU 4.01 GHz, Qualcomm(R) Adreno(TM) X1-85 GPU iPad: iPad Pro M1, 12.9": iPadOS 18.1, Apple Pencil 2, Magic Keyboard Mac: 2023 M2 MacBook Air 15", 16GB memory, macOS Sequoia 15.0.1 Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Old Bruce 1 Quote Link to comment Share on other sites More sharing options...
Pyanepsion Posted September 1 Author Share Posted September 1 37 minutes ago, walt.farrell said: That makes it difficult, perhaps impossible, to devise a general scheme that would work for all texts, even without considering the difficulty of different rules for different languages. Yes, although there are many similarities. In French : Parentheses can be closing only in some contexts, as with enumerations, including some title levels. For example: 2) The level 3 title. Semi-cadratine hyphens, used for incisive clauses, work in pairs. However, the second element may be replaced by a full stop (., ?, !), but in practice this hyphen never poses a concordance problem. For example : blablabla – albalbalb – blablabla. blablabla – albalbalb. Braces, quotes (whether computer, English, French, etc.) always work in pairs. It is also important to note that an element in a pair can never be found in another chapter, or even after a page break. Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
MikeTO Posted September 1 Share Posted September 1 1 hour ago, Pyanepsion said: It would be useful if the Affinity suite could automatically detect elements of a pair without their counterparts, such as quotation marks, brackets, braces, parentheses, or hyphens. For example, an opening quotation mark should not follow another unclosed opening mark, and a closing quotation mark should not appear after another closing mark. Additionally, the search for the corresponding element could be confined to within the chapter. Just FYI, in some English books it's okay to have two opening double quotation marks without a closing one in between, as long as they're both at the start of successive paragraphs. An automated feature would need to be a bit smarter about this. walt.farrell 1 Quote Download a free PDF manual for Affinity Publisher 2.5 Download a quick reference chart for Affinity's Special Characters Affinity 2.5 for macOS Sequoia 15.1, MacBook Pro 14" (M4 Pro) Link to comment Share on other sites More sharing options...
Pyanepsion Posted September 1 Author Share Posted September 1 58 minutes ago, lacerto said: These kinds of texts can get absurdly complex to deal with any kind of automation No. You just have to want to do it, and it's of obvious interest to all those who have to process the texts. I haven't gone any further, but you can already see that errors are immediately visually easier to detect, so image a routine that immediately tells you where the errors are. 14 minutes ago, MikeTO said: Just FYI, in some English books it's okay to have two opening double quotation marks without a closing one in between, as long as they're both at the start of successive paragraphs. An automated feature would need to be a bit smarter about this. This also existed in French during the transition period to en dashes. Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Quote Link to comment Share on other sites More sharing options...
Old Bruce Posted September 1 Share Posted September 1 30 minutes ago, lacerto said: These kinds of texts can get absurdly complex to deal with any kind of automation, especially in English language where things can be complicated by ambiguous use (like minute and second glyphs accidentally curled instead of keeping them straight, or simply by contractions like can't etc.), Here is a bit of "The Red Headed League" a Sherlock Holmes story. It has dialog quoted by Dr. Watson from Sherlock, his client Jabez Wilson who is telling Sherlock and Watson what he said and quoting his assistant as well as the head of the Red Headed League. red head league.txt I have coloured the opening double and single quotes and also the closing single and double quotes here in this Publisher document. SH and JW dialog.afpub I think there are a some contractions and possessive apostrophes as well. For this sort of work you need a very good copy editor, a person, not a piece of software. lacerto, PaulEC, R C-R and 1 other 4 Quote Mac Pro (Late 2013) Mac OS 12.7.6 Affinity Designer 2.5.5 | Affinity Photo 2.5.5 | Affinity Publisher 2.5.5 | Beta versions as they appear. I have never mastered color management, period, so I cannot help with that. Link to comment Share on other sites More sharing options...
walt.farrell Posted September 1 Share Posted September 1 55 minutes ago, MikeTO said: in some English books it's okay to have two opening double quotation marks without a closing one in between, as long as they're both at the start of successive paragraphs. I know it's OK in both modern and older US and UK English texts, when the same person is speaking for multiple paragraphs. I'm curious when it's not OK. Quote -- Walt Designer, Photo, and Publisher V1 and V2 at latest retail and beta releases PC: Desktop: Windows 11 Pro 23H2, 64GB memory, AMD Ryzen 9 5900 12-Core @ 3.00 GHz, NVIDIA GeForce RTX 3090 Laptop: Windows 11 Pro 23H2, 32GB memory, Intel Core i7-10750H @ 2.60GHz, Intel UHD Graphics Comet Lake GT2 and NVIDIA GeForce RTX 3070 Laptop GPU. Laptop 2: Windows 11 Pro 24H2, 16GB memory, Snapdragon(R) X Elite - X1E80100 - Qualcomm(R) Oryon(TM) 12 Core CPU 4.01 GHz, Qualcomm(R) Adreno(TM) X1-85 GPU iPad: iPad Pro M1, 12.9": iPadOS 18.1, Apple Pencil 2, Magic Keyboard Mac: 2023 M2 MacBook Air 15", 16GB memory, macOS Sequoia 15.0.1 Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Old Bruce 1 Quote Link to comment Share on other sites More sharing options...
Pyanepsion Posted September 1 Author Share Posted September 1 @lacerto In my humble opinion, it is essential to differentiate between the resolution of a specific case, designed to respond to a particular situation, and that of a company which aims to create sustainable solutions for a wide audience, by generalising the concept. The first question to ask is to precisely define the potential target, and then identify the resources and skills needed to achieve this objective. How many times have I heard that certain tasks were humanly impossible to achieve, only to see those same tasks successfully accomplished? You have, for example, found a solution by placing vowels at the beginning or end of a line, even though, unless I'm mistaken, this is not recommended in Finnish. For my part, I've automated the colouring of delimiters and their content to make it easier to detect missing elements. However, this only reduces the time needed for the review. I think it would be a good idea to continue the analysis to allow the software to report only the isolated or forgotten delimiter. So, yes, I firmly believe that more advanced automation is possible. The main obstacle will be wanting it. As for the market, it seems vast. Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Quote Link to comment Share on other sites More sharing options...
Old Bruce Posted September 1 Share Posted September 1 4 minutes ago, Pyanepsion said: ... I firmly believe that more advanced automation is possible. ... While this has nothing to do with quotes, consider that there are style guides some of which eschew the use of the Oxford comma and others which require its use. And I am sure that we have all had automatic spill chucking fail. There is a point where the automation can cause problems. lacerto 1 Quote Mac Pro (Late 2013) Mac OS 12.7.6 Affinity Designer 2.5.5 | Affinity Photo 2.5.5 | Affinity Publisher 2.5.5 | Beta versions as they appear. I have never mastered color management, period, so I cannot help with that. Link to comment Share on other sites More sharing options...
MikeTO Posted September 1 Share Posted September 1 45 minutes ago, walt.farrell said: I know it's OK in both modern and older US and UK English texts, when the same person is speaking for multiple paragraphs. I'm curious when it's not OK. Just when it's a simple mistake like this: Lorem ipsum dolor sit amet. <Left quote>I forgot to add a closing quotation mark for this quotation. Lorem ipsum blah blah blah. <Left quote>But this time I remembered.<Right quote> walt.farrell 1 Quote Download a free PDF manual for Affinity Publisher 2.5 Download a quick reference chart for Affinity's Special Characters Affinity 2.5 for macOS Sequoia 15.1, MacBook Pro 14" (M4 Pro) Link to comment Share on other sites More sharing options...
lacerto Posted September 1 Share Posted September 1 Obsolete. Quote Link to comment Share on other sites More sharing options...
R C-R Posted September 2 Share Posted September 2 8 hours ago, MikeTO said: Just FYI, in some English books it's okay to have two opening double quotation marks without a closing one in between, as long as they're both at the start of successive paragraphs. FWIW, in some of the novels I have read, if it is the same speaker in successive paragraphs, there is no paragraph closing quotation mark, but if in the next paragraph someone else begins speaking, the preceding paragraph gets the closing quote. It helps prevent confusion about who is saying what. Quote All 3 1.10.8, & all 3 V2.5.6 Mac apps; 2020 iMac 27"; 3.8GHz i7, Radeon Pro 5700, 32GB RAM; macOS 10.15.7 All 3 V2 apps for iPad; 6th Generation iPad 32 GB; Apple Pencil; iPadOS 15.7 Link to comment Share on other sites More sharing options...
Pyanepsion Posted September 2 Author Share Posted September 2 @Old Bruce I looked at your text. In France, we were still writing in this format just over a hundred years ago. It’s obviously a real typographic headache. 🟩 Managing punctuation pairs is a real headache these days, regardless of the language. A verification tool that understands these rules would be extremely useful. ⁂ For those who are interested… The evolution of dialogue typography The typographic evolution of dialogue in France is a subject that has fascinated me. It reflects cultural, linguistic, and technical developments, as well as the ongoing battle between English and French influences. Here’s an overview of the main stages and periods in the typographic evolution of dialogue in the French style (with a bit about the English style, which I’m less familiar with): 1. Medieval period (before the 15th century) Absence of distinctly typographed dialogue: In medieval manuscripts, dialogues were not differentiated from the rest of the text. They were integrated into the body of the text without any specific markings, sometimes using abbreviations or symbols to indicate changes in speakers. 2. Invention of printing (15th century) Birth of modern typography: With Gutenberg's invention of printing around 1450, dialogues began to be printed. However, they were still barely distinguishable from the rest of the text, often inserted without quotation marks or dashes. Dialogues might be indicated by paragraph changes or by capitalized initial letters. 3. 16th-17th centuries Appearance of quotation marks: During the 16th century, quotation marks began to appear in France, initially in the form of double commas at the bottom of a line to enclose quotations or dialogue. However, their use remained inconsistent. Use of dashes: From the 17th century onwards, dashes began to be used to indicate the beginning of lines in dialogue. This practice gradually became more common, particularly in the theater. 4. Eighteenth century Standardization of quotation marks: Quotation marks became increasingly common for framing dialogue, thanks in particular to the influence of the Académie française and Parisian printers. Quotation marks took on a more standardized form, at the top of the line (“…”), to indicate direct speech. More dashes: The use of dashes to introduce lines in dialogue became almost systematic, especially in literary works such as novels and plays. 5. 19th century Affirmation of the modern model: The 19th century saw the spread of the system we know today: dialogues were introduced by dashes or framed by quotation marks. Dashes were mostly used in novels to mark each new line, while quotation marks were used to frame specific sentences or passages. Typographic codification: Typographic practices became standardized thanks to advances in printing technology and efforts by publishing houses to establish precise typographical rules. 6. 20th century Clarification of typographic rules: The 20th century saw the definitive establishment of current standards for dialogue typography in France. French quotation marks (« … ») were preferred over English quotation marks (“…”), although the latter began to appear in certain contexts, particularly in the translation of foreign books. Typographic experiments: With the literary avant-garde (particularly the Nouveau Roman), some authors began to experiment with typography, playing with the presentation of dialogue (absence of dashes, use of italics, etc.) to create particular stylistic or narrative effects. 7. 21st century Diversification of practices: The advent of the typewriter and later computers introduced the use of straight quotation marks ("…"), which became widespread for convenience. Although modern software automatically corrects this issue in many cases, it is not infallible. For example, in the Affinity suite, text formatting in Publisher differs from that in Photo and Designer, where typographical errors linked to keyboard use are not always corrected, leading to inconsistencies. Anglo-Saxon influence: The influence of Anglo-Saxon standards, marked by the use of straight quotation marks and other typographic conventions, is becoming more pronounced among certain authors steeped in American literature. This trend forces French-language typographers to make tedious adjustments to bring texts in line with French conventions, which are simpler and more restrained. ⁂ Integrating a function capable of correctly managing typographical pairs of any kind would be a major asset for the Suite, particularly as it is software closely linked to writing. 📌 The example of programming tools such as Notepad++, Visual Studio Code, and PhpStorm, which are beginning to offer these features as part of the coding process, demonstrates the importance of such improvements. Adding such a feature to the Suite would not only be beneficial but essential to ensuring consistent, professional typography. Old Bruce and lacerto 2 Quote 6 cœurs, 12 processus - Windows 11 pro - 4K - DirectX 12 - Suite universelle Affinity (Affinity Publisher, Affinity Designer, Affinity Photo). ███ Mais je vous le demande, peut-on imaginer une police sans sérifs ? Link to comment Share on other sites More sharing options...
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.
Note: Your post will require moderator approval before it will be visible.