blob: 50d5c512a01a4538291e01a0d3c47c4a6bde9068 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
|
* find a better way to run "find" without creating the index storage, if one
if one is not already readily available (understand cost now, improve after merge)
* improve performance of splitlines (CF)
* think about cost of utf8 list strategy (CF)
* revisit why runicode import str_decode_utf_8_impl needed instead of runicode
import str_decode_utf_8
* revisit remaining places in win32 where we do utf8.decode('utf-8'), they should work
directly with utf8 (can be converted via runicode.str_decode_utf_8 as well)
- rutf8.utf8_encode_mbcs
- unicodehelper.fsencode
- _winreg.interp_winreg
* remove 'assert not isinstance(*, unicode)
* add a flag that prevents support for unicode in rpython and enable it in PyPy (CF, Armin)
* convert all realunicode_w to unicode_w after we flush out all old uses of
unicode_w
* review all uses of W_Unicode.text_w, right now it is exactly W_Unicode.utf8_w.
It shoud only return valid utf8 (see 0be26dc39a59 which broke translation on
win32 and failed tests on linux64). Then we can use it in places like
_socket.interp_func.getaddrinfo instead of space.encode_unicode_object(w_port,
'utf-8', 'strict')
|