Ticket #28 (closed defect: worksforme)
Encoding problems when converting svn -> darcs (or bzr).
|Reported by:||Luca <luca@…>||Owned by:||lele|
|Keywords:||svn svndump non ascii error í||Cc:|
I'm having some problems with the encoding of the character í ('i' with acute accent, í as HTML entity). When converting a svn repository to darcs I get this message:
00:37:07 [I] Changeset "43" 00:37:07 [I] Log message: - Nuevo nivel de logging CRITICAL (L_CRI) para concordar con python. - Mínimo cambio en el formato de logging. - Cambio de sección de configuración de DB_DataObject a DBO. 00:37:07 [I] 110 pending changesets in state file 00:37:07 [C] Upstream change application failed Configuration error: 'ascii' codec can't encode character u'\xed' in position 216: ordinal not in range(128): it seems that current encoding "UTF-8" cannot properly represent at least one of the characters in the upstream changelog. You need to use a wider character set, using "encoding" option.
My locale is UTF-8, but I even used the encoding option with no results. The weir thing is other non-ascii characters seems to work fine (á, é, ó, ú). When I use the svndump as the source, I've got no errors, but 'í' characters are not encoded properly:
Fri Feb 4 12:19:47 ART 2005 luca * - Nuevo nivel de logging CRITICAL (L_CRI) para concordar con python. - MÃ\adnimo cambio en el formato de logging. - Cambio de sección de configuración de DB_DataObject a DBO.
As you can see, ó in configuración is just fine, but í in Mínimo is encoded as MÃ\adnimo, which is wrong.
It's easy to reproduce the problem:
cd /tmp svnadmin create testrepo svn co file:///tmp/testrepo testwc touch testwc/test svn add testwc/test svn ci -m 'í' testwc
Now you can tailor this repository to convert it to darcs with svn as repo and you'll get the error, or 'svnadmin dump'it and use svndump as repo to get the wrong encoding.
- Subversion: 1.2.3 (r15833)
- Darcs: 1.0.4
- Tailor: 0.9.19
- Summary changed from Encoding problems when converting svn -> darcs to Encoding problems when converting svn -> darcs (or bzr).