|
4 months ago | |
---|---|---|
.gitignore | 1 year ago | |
README.md | 5 months ago | |
index.js | 4 months ago | |
package.json | 4 months ago | |
test.js | 4 months ago | |
yarn.lock | 4 months ago |
Scrape data from DeepL translator without applying for the paid authenticated API, using Puppeteer, Chrome browser's headless API.
From npm
yarn add deepl-scraper
or
npm i deepl-scraper --save
puppeteer
on Linuxsudo apt install libnss3-dev libxss1
sudo sysctl -w kernel.unprivileged_userns_clone=1
const { translate, getSupportedLanguages, quit } = require('deepl-scraper');
translate(sentence, source, target).then(console.log).catch(console.error);
sentence
string - Word/sentence to be translatedsource
string (optional) - Word/sentence original language
auto
target
string - Language for word/sentence to be translatedThe module doesn't store languages and will always support DeepL's languages list without required update.
IETF tags are used as language format.
Get arrays of supported source and target languages using the following code :
getSupportedLanguages().then(console.log);
{
"sourceLanguages": [
"en",
"de",
"fr",
"es",
"pt",
"pt",
"it",
"nl",
"pl",
"ru",
"ja",
"zh"
],
"targetLanguages": [
"en-US",
"en-GB",
"de-DE",
"fr-FR",
"es-ES",
"pt-BR",
"it-IT",
"nl-NL",
"pl-PL",
"ru-RU",
"ja-JA",
"zh-ZH"
]
}
Since this module uses a headless browser, it won't quit as long as your main script is ronning or until you quit it using the following code :
quit()
INVALID_SOURCE_LANGUAGE
INVALID_TARGET_LANGUAGE
UNSUPPORTED_SOURCE_LANGUAGE
UNSUPPORTED_TARGET_LANGUAGE
translate('hello', 'en', 'fr-FR').then(console.log);
{
"source": {
"lang": "en",
"sentence": "hello"
},
"target": {
"lang": "fr",
"sentences": [
"Bonjour,",
"Bonjour",
"bonjour",
"hello",
"h"
],
"translation": "hello"
}
}
(a single word's language is usually more difficult to detect)
// Either
translate('hello', 'auto', 'fr-FR').then(console.log);
// Or
translate('hello', undefined, 'fr-FR').then(console.log);
{
"source": {
"lang": "en",
"confident": false,
"sentence": "hello"
},
"target": {
"lang": "fr",
"sentences": [
"Bonjour,",
"Bonjour",
"bonjour",
"hello",
"h"
],
"translation": "hello"
}
}
(a sentence's language is usually more efficient)
translate('hey, what\'s up ?', 'auto', 'fr-FR').then(console.log);
{
"source": {
"lang": "en",
"confident": false,
"sentence": "hey, what's up ?"
},
"target": {
"lang": "fr",
"sentences": [
"hey, quoi de neuf ?",
"Hé, qu'est-ce qu'il y a ?",
"h",
"Hé, quoi de neuf ?",
"Hé, qu'est-ce qu'il y a ?"
],
"translation": "Hé, qu'est-ce qu'il y a ?"
}
}
Why using Puppeteer instead of HTTP requests ?
Fucking rate limit error always showing up, whatever you do.
1.0.0
(2019-05-06) • Initial release1.0.x
(2019-05-11)
1.0.1
README.md
1.0.2
1.0.3
1.0.4
1.0.5
(2019-06-27)
getSupportedLanguages
method1.0.6
(2019-07-05)
1.0.7
(2019-07-05)
1.0.9
(2020-07-10)
quit
parameter from translate
method to new quit
method1.0.10
(2020-08-21) • Actually remove jsdom NPM dependency1.0.11
(2020-08-24)
UNSUPPORTED_SOURCE_LANGUAGE
& UNSUPPORTED_TARGET_LANGUAGE
errors