creating a linguistic corpus દોર પોસ્ટ કરનાર: Sandro Ruggeri Dulcet
|
We are a team engaged in creating a linguistic corpus from English, Portuguese and Spanish texts. We can also envisage some other expected outcomes like related to those texts glossaries on topics specific to the Amazonian Region (Environment, Culture, Politics).
Does anyone have a background or a previous experience in that kind of projects that could be useful to us?
Thanks in advanced
Sandro Ruggeri Dulcet | | | Steven Capsuto યૂનાઇટેડ સ્ટેટસ્ Local time: 00:26 સભ્ય (2004) સ્પેનીશ થી અંગ્રેજી + ... Out of curiosity | Oct 5, 2007 |
Sandro Ruggeri wrote:
We are a team engaged in creating a linguistic corpus from English, Portuguese and Spanish texts.
Are you creating a parallel corpus, using translations of the same text in three languages? Or are they separate texts in each language? | | | working (ideally) with parallel texts | Oct 5, 2007 |
Dear Steven since we want this to become a useful tool for local and translators worldwide, we intend to create the corpus from parallel texts. This may also be a way to overcome some gaps or even very commonly spread misunderstanding in terminology regarding biological subjects for instance. However, we must before see what kinds of texts we will be able to put together to work with. | | | Lia Fail (X) સ્પેન Local time: 06:26 સ્પેનીશ થી અંગ્રેજી + ... I posted a reply, where is it????? | Oct 7, 2007 |
Sandro Ruggeri wrote:
Dear Steven since we want this to become a useful tool for local and translators worldwide, we intend to create the corpus from parallel texts. This may also be a way to overcome some gaps or even very commonly spread misunderstanding in terminology regarding biological subjects for instance. However, we must before see what kinds of texts we will be able to put together to work with.
I posted a reply, where is it????? I referred to bi/trilingual and comparable corpora.
I mentioned what Steven mentioned, is this parallel? That, is this translated texts?
Comparable, by the way, looks at texts produced in the same genre, domain etc (they are not translations, but they are similar). EG, a newspaper article in a Brazilian newspaper describing what SARS is and a similar (NOT TRANSLATED article) describing SARS in a GB (or USA) paper.
I am involved in corpus projects (we give a workshop and we research tools, and I think, we are pretty much up there when it comes to tools for corpora for specialist translators to EN). SEE http://www.metmeetings.org/pagines/METM07/Workshops/Workshop0.htm
I personally have no interest in parallel corpora, as translated text is not good guidance for translators (after all, that's what a translation memory is). It's of interest to academics who want to see what "happens" in translations. And if it's a question of terminology, well, that's another issue, and it doesn't need "corpora", it needs glossaries and termbases and such.
My perspective - and obsession -- is to "write the language of domain", untainted by the source language, the idea being to minimise the pernicious effects of the source structures ...move away from the all-engrossing effects of the source and focus instead on the target as a model of good writing.
Which is why I only want corpora in the target language
My 2 cents:-)
[Edited at 2007-10-07 22:56]
[Edited at 2007-10-07 22:57] | |
|
|
For a better understanding... | Oct 8, 2007 |
Hi Lia (I found you in my Spanish and here you are again my English thread post!)
Sorry, there is something I miss in your reply. However, I can intuitevely perceive that it can turn to be extremely important for our project. Could you explain in detail what you mean when you say "My perspective...is to write the language of domain"...etc?
Cheers
Sandro | | | કોઇ મધ્યસ્થી સ્પષ્ટ રીતે આ મંચ માટે નિમવામાં આવ્યો નથી. સાઇટના નિયમોના ઉલ્લંઘનોની જાણ કરવા અથવા મદદ મેળવવા, કૃપા કરી સાઇટ સ્ટાફ » નો સંપર્ક કરો creating a linguistic corpus Wordfast Pro |
---|
Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
| Protemos translation business management system |
---|
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!
The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.
More info » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |