javascript - Auto-Detect character encoding in Node.js -
i looking method accurately detect character encoding of srt subtitle files, given (user-selected) language. using jschardet library detection, unfortunately gets wrong often.
var fs = require("fs"); var jschardet = require("jschardet"); var iconv = require('iconv-lite'); var loadfile = function(path, langcode){ var content = fs.readfilesync(path); var encoding = jschardet.detect(content).encoding.tolowercase(); return iconv.decode(content, encoding); }; (added in case there's wrong code)
is there more accurate jschardet? perhaps receives language argument? guess info improve accuracy lot, i'm not familiar libraries receive language argument.
Comments
Post a Comment