我想在python中把windows-1252转换为utf-8,我写了这段代码。
def encode(input_file, output_file):
f = open(input_file, "r")
data = f.read()
f.close()
# Convert from Windows-1252 to UTF-8
encoded = data.encode('Windows-1252').decode('utf-8')
with safe_open_w(output_file) as f:
f.write(encoded)
但我有这个错误。
encoded = data.encode('Windows-1252').decode('utf-8')
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xe8 in position 5653: invalid continuation byte
我试图用这个元标签转换一个HTML。
<meta http-equiv="Content-Type" content="text/html; charset=windows-1252">