Deepseek is great at translation tasks thanks to its reasoning capabilities

CriticalResist8@lemmygrad.ml · edit-2 4 months ago

Deepseek is great at translation tasks thanks to its reasoning capabilities

KrasnaiaZvezda@lemmygrad.ml · 4 months ago

(We’re going to need an AI community lol instead of posting to genzedong all the time)

I had made c/Singularity for things like this but most of the talks about AI/LLMs are on c/technology as it’s big.

then that’s additional context the LLM can use to understand what it’s translating by reading the key property.

I was just thinking that it would be easy to remove the keys for reduced tokens but treating it as extra context for the LLM makes a lot of sense.

Nice job!

And two questions: Do you ask the LLM after it’s done if there were mistakes or if there is anything that can be improved as well? And as for books and longer texts do you have to break them up or do you keep to things that can be done one go?

CriticalResist8@lemmygrad.ml · 4 months ago

For books I break it up, but deepseek seems to be able to handle a huge amount of tokens. If it can’t handle it anymore (if the convo gets too long), it will return a server error so I just copy my initial prompt and a long portion of text and start over in a new chat.

Just to make sure, I tell it in my initial ‘framing’ prompt that I’m going to be sending excerpts sequentially and that it should only return the translation and nothing else.

For mistakes etc you could probably ask it to do a second pass. You might even want to try a new, fresh chat so that it doesn’t know what the original was. That’s a good idea that I hadn’t thought about!

I was just thinking that it would be easy to remove the keys for reduced tokens but treating it as extra context for the LLM makes a lot of sense.

And it saves on effort too if you just send it the full file x)

KrasnaiaZvezda@lemmygrad.ml · 4 months ago

lol

For mistakes etc you could probably ask it to do a second pass. You might even want to try a new, fresh chat so that it doesn’t know what the original was. That’s a good idea that I hadn’t thought about!

In some tests I was doing with using Qwen 0.6B LLMs for classification I did ask it multiple times and basically give more weight the more tries something appears in. In your case you can probably ask two different models and take anything translated equally both times as “good enough” and use an(other) LLM to check the remainining things, although the longer the sentence/text/key the less such a system is likely to help and the more the raw LLM abilities will be necessary.

And as for asking the LLMs for mistakes I was curious because big LLMs should be able to catch some mistakes due to reflexion…

CriticalResist8@lemmygrad.ml · 4 months ago

I tried your proofread method with another file and I think there’s definitely some merits, to make sure that specific terms get translated the same way each time and improving consistency. I just asked deepseek to do a second pass and look for consistency, typos, errors, etc. Didn’t seem to have a lot to correct though.