Today, when dealing with text data, we encountered this kind of text matching with space and tab regularization, which did not work. Later, I asked my colleagues and found that “\ \ uf8f5” could be used to match.
Pending text:
A abbr.安
A-10IInone.美空军主力近距离空中支援攻击机,无愧为“坦克杀手”。
A-12none.夭折的美海军第一种隐形舰载攻击机。
A-4 none.54年服役的单座轻型舰载攻击机,现仍被多国使用。
A-6none.双座重型全天候舰载攻击机,主要用于低空突防,可进行核打击。
A-7IInone.离开沙场的单座亚音速攻击机,曾是美海空军主力。
A-OKnone.极好, 妙极, 完美的
A-Znone.无所不包的
A-boilern.原子反应器加热用的锅炉
A-bombn.原子弹
A-certificatenone.儿童不宜n.A级
A-controln.原子能管制
A-energyn.原子能
A-framen.金字塔形建筑物
A-lovelnone.英语学校里某一课程结束时举行的高深考试, 高深级考试及格
A-oneadj.第一等的, 第一流的
A-roadnone.A级公路, 主车道
A-siden.A面
A-testn.原子爆炸试验
A-weaponn.原子武器
Separation processing:
public static void main(String[] args) throws Exception {
String dic = util.Directory.GetAppPath("steamData") + "dic.txt.bak";
BufferedReader br = util.MyFileTool.GetBufferReader(dic);
while(br.ready()) {
String line = br.readLine();
String[] words = line.split("\\uf8f5");
System.out.println("size: " + words.length);
System.out.println(words[0]);
}
br.close();
}
Read More:
- Curl returns empty reply from server. Due to the processing of special characters, curl cannot be accessed and the browser can access it.
- Solution to the problem that some special characters in finereport cannot be displayed or displayed as question mark
- A solution to automatically convert special characters into Unicode when taking out data from MySQL and encapsulating it into JSON
- IE8: this operation cannot be completed due to error 80020101
- About Java File.separator
- Split keyword in ABAP when the separator is at the beginning and end of the string
- Andorid: Installation failed due to invalid APK file due to version mismatch
- Git authentication failure solution, due to the problem of password modification
- Due to multi process — pychar debug breakpoint debugging encounter pychar dataloader will be stuck
- ClobberError: The package ‘xxx‘ cannot be installed due to a path collision for ‘xx‘ This path alre
- Java String.split () special character processing
- Error 945 Database cannot be opened due to inaccessible files or insufficient memory or disk space
- The scan of annotations for web application [] could not be completed due to a stackoverflower error. Possible root causes include the low setting of – XSS and illegal circular inheritance dependencies. The class hierarchy being processed is
- Solution to the problem that the toolbar menu button cannot be clicked
- Practice — CSS3 — frame animation to achieve special effects
- Content rendering error: a solution to the problem of zero Download document cannot be opened
- Unitypackage failed to install or the express solution could not be installed
- Error c2259 cannot instance abstract class due to following members
- [Solved] VUE Error: “You may use special comments to disable some warnings.Use // eslint-disable-next-line to …
- Regular expressions filter special characters