Set the minimum repetition count from 3 to 2 or make it configurable

Tekl's Avatar

Tekl

29 Jan, 2014 03:20 PM

I like the repetition count in Marked, but it's useless for me as it only finds three or more repetitions. I also need marks for two repetitions to see if words are used twice in a paragraph.

Example

We will include our comments once we have gathered all the relevant information. We will send the final report as soon as it is finished.

Further ideas

It would also be nice to have a third mode in addition to document or paragraph. Repetitions are most annoying when they are near by. So a near-by-mode would be useful. Think of a repetition within the prepending and following 100 words.

I hope you understand my idea.

With kind regards,

Tekl

  1. Support Staff 1 Posted by Brett on 29 Jan, 2014 03:55 PM

    Brett's Avatar

    I'm willing to make it configurable between 2 and 4 or 5. That's not difficult.

    "Nearby" repetition would be a lot more intensive for processing as it would mean constant buffering and overlapping of strings. I'll test implementing it, but paragraph scope already takes over a minute on medium size documents, it might just be too slow to be practical.

  2. 2 Posted by Tekl on 29 Jan, 2014 04:39 PM

    Tekl's Avatar

    Thanks for improving this. It's still buggy.

    Try this text and add the Word "Maecenas" behind "Aliquam". Nothing will be highlight in this paragraph anymore. Also the word "consectetur" will never be highlighted.

    ## Wordrepetition test

    Lorem ipsum dolor sit amet, consectetur adipiscing elit. Proin tincidunt auctor nibh id malesuada. Nunc et urna nisi, id mattis nulla. Ut rutrum sapien augue, vel ultricies nisl. Maecenas tincidunt volutpat tincidunt. Aliquam erat volutpat. In gravida ultricies dolor vitae malesuada. Ut hendrerit hendrerit risus a feugiat. Maecenas ullamcorper arcu nec tellus blandit nec pharetra nisi fermentum. Vivamus nisi mi, pulvinar non blandit quis, pellentesque at lectus. Pellentesque habitant morbi tristique senectus et netus et malesuada fames ac turpis egestas.

    Praesent et elit velit, eu consectetur nibh. Praesent et elit velit, eu consectetur nibh.

  3. Support Staff 3 Posted by Brett on 29 Jan, 2014 05:06 PM

    Brett's Avatar

    Ok, so the Maecenas Aliquam issue is because I had it set to ignore proper names, which that was recognized as. I removed that restriction for now, will test further.

    "consectetur" is recognized by NSLinguisticTagger as a preposition, which I'm definitely ignoring. In normal English usage, this isn't going to be an issue. It would be horrible if every "the" in the text was a repeat.

  4. 4 Posted by Tekl on 30 Jan, 2014 10:01 AM

    Tekl's Avatar

    Ah, thanks for clarification. Do you have plans to use NSLinguisticTagger like in Writer Pro?

Reply to this discussion

Internal reply

Formatting help / Preview (switch to plain text) No formatting (switch to Markdown)

Attaching KB article:

»

Attached Files

You can attach files up to 10MB

If you don't have an account yet, we need to confirm you're human and not a machine trying to post spam.

Keyboard shortcuts

Generic

? Show this help
ESC Blurs the current field

Comment Form

r Focus the comment reply box
^ + ↩ Submit the comment

You can use Command ⌘ instead of Control ^ on Mac