https://www.proz.com/forum/sdl_trados_support/20999-segmentation_rules_for_references.html

Segmentation rules for references?
Thread poster: OTMed (X)
OTMed (X)
OTMed (X)
Poland
Local time: 12:23
English to Polish
+ ...
May 7, 2004

Hi

We are working on a series of documents containing quite a number of references in the following format:
{Sentence1}.{Ref. No in superscript}{space} {Sentence2},
eg. quote
This effect has been described in a number of studies.11,14 There is however no available data to support carcinogenic effect.
end quote

The Trados built-in ‘full stop’ segmentation rule does not apply here, as there is no trailing space following the dot.
I have
... See more
Hi

We are working on a series of documents containing quite a number of references in the following format:
{Sentence1}.{Ref. No in superscript}{space} {Sentence2},
eg. quote
This effect has been described in a number of studies.11,14 There is however no available data to support carcinogenic effect.
end quote

The Trados built-in ‘full stop’ segmentation rule does not apply here, as there is no trailing space following the dot.
I have been therefore trying to create a new rule with:
0 leading chars
0 trailing whitespaces
1 leading words
’.’ stop character

However for some rather unclear reason this rule is not being implemented. What is more, WB would not save this set of options – next time I open Segmentation rules tab, 1 is inserted in Trailing whitespaces box.
Could anyone assist? Your help will be appreciated.
Collapse


 
tectranslate ITS GmbH
tectranslate ITS GmbH
Local time: 12:23
German
+ ...
Preprocessing May 7, 2004

In a case like this, I would tend towards developing a preprocessing procedure for the files rather than altering the segmentation rules, which can cause all sorts of problems later on.
Without looking at your actual file it is hard to make conclusive suggestions, but you might want to look into the possibility of marking up the ".11,14" bits as tw4WinInternal (or tw4WinExternal.)
Another possibility might be to add paragraph marks ("Return"s) before and after these through a find-an
... See more
In a case like this, I would tend towards developing a preprocessing procedure for the files rather than altering the segmentation rules, which can cause all sorts of problems later on.
Without looking at your actual file it is hard to make conclusive suggestions, but you might want to look into the possibility of marking up the ".11,14" bits as tw4WinInternal (or tw4WinExternal.)
Another possibility might be to add paragraph marks ("Return"s) before and after these through a find-and-replace operation and when finished, removing them again.

Sorry I can't be of more help at this point.

I still hope this helps somehow.

Regards,
Benjamin
tectranslate, Germany
Collapse


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

Segmentation rules for references?


Translation news related to SDL Trados





Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »