如果获取编码信息？

问题描述

面试的时候，面试官问到这样一个问题：“在项目中，出现乱码的情况，你怎么获得该信息的原编码？”。（当时就哑了，不知道怎么回答，望大牛们教教）

解决方案

import java.io.BufferedInputStream;import java.io.File;import java.io.FileInputStream;public class FileEncodeReferee{ private File file; public FileEncodeReferee(File file) { this.file = file; } public FileEncodeReferee(String path) { file = new File(path); } public String getCharset() { File file = this.file; String charset = "GBK"; byte[] first3Bytes = new byte[3]; BufferedInputStream bis = null; try { //boolean checked = false; bis = new BufferedInputStream(new FileInputStream(file)); bis.mark(0); int read = bis.read(first3Bytes, 0, 3); if (read == -1) { return charset; } if (first3Bytes[0] == (byte) 0xFF && first3Bytes[1] == (byte) 0xFE) { charset = "UTF-16LE"; //checked = true; } else if (first3Bytes[0] == (byte) 0xFE && first3Bytes[1] == (byte) 0xFF) { charset = "UTF-16BE"; //checked = true; } else if (first3Bytes[0] == (byte) 0xEF && first3Bytes[1] == (byte) 0xBB && first3Bytes[2] == (byte) 0xBF) { charset = "UTF-8"; //checked = true; } /** *//******************************************************************* * bis.reset(); if (!checked) { int loc = 0; while ((read = * bis.read()) != -1) { loc++; if (read >= 0xF0) { break; } if (0x80 <= * read && read <= 0xBF) // 单独出现BF以下的，也算是GBK { break; } if (0xC0 <= * read && read <= 0xDF) { read = bis.read(); if (0x80 <= read && * read <= 0xBF)// 双字节 (0xC0 - 0xDF) { // (0x80 - 0xBF),也可能在GB编码内 * continue; } else { break; } } else if (0xE0 <= read && read <= * 0xEF) { // 也有可能出错，但是几率较小 read = bis.read(); if (0x80 <= read && * read <= 0xBF) { read = bis.read(); if (0x80 <= read && read <= * 0xBF) { charset = "UTF-8"; break; } else { break; } } else { * break; } } } System.out.println(loc + " " + * Integer.toHexString(read)); } ******************************************************************/ } catch (Exception e) { e.printStackTrace(); } finally { if (bis != null) { try { bis.close(); } catch (Exception ex) { ex.printStackTrace(); } } } return charset; } public static void main(String[] args) { FileEncodeReferee fer = new FileEncodeReferee("E://Huha.csv"); System.out.println(fer.getCharset()); }}
解决方案二：
真不会的时候可以直接反过去请教下呗..貌似获文件编码好像是比较困难, 可以用Uedit32打开文件,看匹配的编码是哪一个,再者就的使用第三方提供的jar 读取文件分析了.
解决方案三：
你问问是乱码情况时的编码还是出现乱码前一版本的编码呗.

时间： 2024-09-14 23:38:39

如果获取编码信息？

问题描述

解决方案

如果获取编码信息？的相关文章

php获取浏览器信息、访客语言、访客操作系统、访客IP地址及访客地理位置

php-GridView显示PHP从数据库中查询获取的信息，英文显示正常，中文显示&amp;quot;??“

Android——使用Volley+fastJson在新线程中读取网络接口获取天气信息

android-WIFI扫描获取相关信息

重新想象 Windows 8 Store Apps (60) - 通信: 获取网络信息, 序列化和反序列化

JavaWeb学习之Servlet（四）----ServletConfig获取配置信息、ServletContext的应用

完整删除ecshop中获取店铺信息的API_php实例

PHP版微信第三方实现一键登录及获取用户信息的方法_php技巧

html文件利用ajax方法通过 servlet 获取数据库信息显示乱码，求解？