summaryrefslogtreecommitdiff
path: root/doc/bugs/forget_corrupts_non-ascii_chars.mdwn
blob: f4506c2284a8427ca937e00b5161a204a6cae64b (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
### Please describe the problem.

it seems that `git annex forget --drop-dead --force` will double-encode UTF-8 characters.

### What steps will reproduce the problem?

Have a remote with an accent in the description.

### What version of git-annex are you using? On what operating system?

5.20140412 on debian jessie amd64.

### Please provide any additional information below.

[[!format sh """
anarcat@marcos:video$ git annex info
repository mode: indirect
trusted repositories: 1
        5adbab10-0f7a-467b-b0d8-5d7af2223103 -- here (backup (anarcat@marcos:/srv/video))
semitrusted repositories: 5
        00000000-0000-0000-0000-000000000001 -- web
        2d594c02-9164-4ebd-ac47-f8cdc9c16e2f -- rachel@topcrapn:~/Vidéos/anarcat
        2f90b958-95e4-44e3-8d3b-e780b63936d1 -- portable (300GB drive labeled VHS)
        96365189-a0b9-4c33-865f-b573d98572dc -- trouneslo@ifuckinglovecumputers:~/video-anarcat
        ae3d62e6-49be-4340-ba25-c8736a1637c4 -- anarcat@angela:~/video
untrusted repositories: 1
        f6a70cfe-0ee4-4762-9f1d-b7b545fbd856 -- External hard drive labeled VHS
transfers in progress: none
available local disk space: 14.29 gigabytes (+10 gigabytes reserved)
local annex keys: 4110
local annex size: 991.85 gigabytes (+ 2 unknown size)
annexed files in working tree: 4108
size of annexed files in working tree: 1.01 terabytes (+ 2 unknown size)
bloom filter size: 16 mebibytes (0.8% full)
backend usage:
        SHA256E: 8214
        URL: 4
anarcat@marcos:video$ git annex dead f6a70cfe-0ee4-4762-9f1d-b7b545fbd856
dead f6a70cfe-0ee4-4762-9f1d-b7b545fbd856 ok
(Recording state in git...)
anarcat@marcos:video$ git annex unused
unused . (checking for unused data...) (checking HEAD...) (checking synced/master...) (checking backup/master...) (checking portable/annex/direct/master...) (checking portable/master...) (checking portable/synced/master...) (checking steph/master...) (checking steph/synced/master...) (checking video/synced/master...) ok
anarcat@marcos:video$ git annex forget --drop-dead --force
forget git-annex (Recording state in git...)
ok
(Recording state in git...)
anarcat@marcos:video$ git annex info
repository mode: indirect
trusted repositories: (merging synced/git-annex backup/git-annex backup/synced/git-annex portable/git-annex portable/synced/git-annex steph/git-annex video/synced/git-annex into git-annex...)
(Recording state in git...)
(Recording state in git...)
1
        5adbab10-0f7a-467b-b0d8-5d7af2223103 -- here (backup (anarcat@marcos:/srv/video))
semitrusted repositories: 5
        00000000-0000-0000-0000-000000000001 -- web
        2d594c02-9164-4ebd-ac47-f8cdc9c16e2f -- rachel@topcrapn:~/Vidéos/anarcat
        2f90b958-95e4-44e3-8d3b-e780b63936d1 -- portable (300GB drive labeled VHS)
        96365189-a0b9-4c33-865f-b573d98572dc -- trouneslo@ifuckinglovecumputers:~/video-anarcat
        ae3d62e6-49be-4340-ba25-c8736a1637c4 -- anarcat@angela:~/video
untrusted repositories: 0
transfers in progress: none
available local disk space: 14.25 gigabytes (+10 gigabytes reserved)
local annex keys: 4110
local annex size: 991.85 gigabytes (+ 2 unknown size)
annexed files in working tree: 4108
size of annexed files in working tree: 1.01 terabytes (+ 2 unknown size)
bloom filter size: 16 mebibytes (0.8% full)
backend usage:
        SHA256E: 8214
        URL: 4
# End of transcript or log.
"""]]

well that's interesting - the above paste is broken by ikiwiki as well... in the text area where i paste it, "rachel@topcrapn:~/Vidéos/anarcat" shows up as "rachel@topcrapn:~/Vidéos/anarcat" but when i preview, the character gets corrupted. and obviously, the second instance then gets *double* corrupted - wheepee. the original paste has "rachel@topcrapn:~/Vidéos/anarcat". --[[anarcat]]

> [[fixed|done]]; writes to git-annex branch now preserve the original
> encoding, and I've tested that lots of interesting unicode is preserved
> across a forget run. --[[Joey]]