java - 如何替换xml文件中字符串的一部分?

标签 java xml parsing dom xpath

我有一个 xml 文件,内容如下:

<Verbiage>
        The whiskers plots are based on the responses of incarcerated 
        <Choice>
            <Juvenile> juveniles who have committed sexual offenses. </Juvenile>
            <Adult> adult sexual offenders. </Adult>
        </Choice> 
        If the respondent is a 
        <Choice>
            <Adult>convicted sexual offender, </Adult>
            <Juvenile>juvenile who has sexually offended, </Juvenile>
        </Choice> 
        #his/her_lc# percentile score, which defines #his/her_lc# position 
        relative to other such offenders, should be taken into account as well as #his/her_lc# T score. Percentile 
        scores in the top decile (> 90 %ile) of such offenders suggest that the respondent 
        may be defensive and #his/her_lc# report should be interpreted with this in mind.
    </Verbiage>

我正在尝试找到一种方法来解析 xml 文件(我一直在使用 DOM),搜索 #his/her_lc# 并将其替换为“her”。我尝试过使用 FileReader、BufferedReader、string.replaceAll、FileWriter,但这些都不起作用。

有没有办法使用 XPath 来做到这一点?

最终我想在这个 xml 文件中搜索该字符串并将其替换为另一个字符串。

我是否必须在字符串周围添加一个标签,我希望它以这种方式解析它?

我尝试过的代码:

protected void parse() throws ElementNotValidException {
    try {
        //Parse xml File
        File inputXML = new File("template.xml");
        DocumentBuilderFactory parser = DocumentBuilderFactory.newInstance(); // new instance of doc builder
        DocumentBuilder dParser = parser.newDocumentBuilder(); // calls it
        Document doc = dParser.parse(inputXML); // parses file

        FileReader reader = new FileReader(inputXML);
        String search = "#his/her_lc#";
        String newString;

        BufferedReader br = new BufferedReader(reader);
        while ((newString = br.readLine()) != null){
            newString.replaceAll(search, "her");
        }

        FileWriter writer = new FileWriter(inputXML);
        writer.write(newString);
        writer.close();

    } catch (ParserConfigurationException e) {
        e.printStackTrace();
    } catch (IOException e) {
        e.printStackTrace();
    } catch (SAXException e) {
        e.printStackTrace();
    }

我需要修复的代码:

try {
        File inputXML = new File("template.xml"); // creates new input file
        DocumentBuilderFactory parser = DocumentBuilderFactory.newInstance(); // new instance of doc builder
        DocumentBuilder dParser = parser.newDocumentBuilder(); // calls it
        Document doc = dParser.parse(inputXML); // parses file
        doc.getDocumentElement().normalize();

        NodeList pList = doc.getElementsByTagName("Verbiage"); // gets element by tag name and places into list to begin parsing

        int gender = 1; // gender has to be taken from the response file, it is hard coded for testing purposes
        System.out.println("----------------------------"); // new line

        // loops through the list of Verbiage tags
        for (int temp = 0; temp < pList.getLength(); temp++) {
            Node pNode = pList.item(0); // sets node to temp

            if (pNode.getNodeType() == Node.ELEMENT_NODE) { // if the node type = the element node
                Element eElement = (Element) pNode;
                NodeList pronounList = doc.getElementsByTagName("pronoun"); // gets a list of pronoun element tags

                if (gender == 0) { // if the gender is male

                    int count1 = 0;
                    while (count1 < pronounList.getLength()) {

                        if ("#he/she_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("he");
                        }

                        if ("#he/she_caps#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("He");
                        }

                        if ("#his/her_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("his");
                        }
                        if ("#his/her_caps#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("His");
                        }

                        if ("#him/her_lc#".equals(pronounList.item(count1).getTextContent())) {
                            pronounList.item(count1).setTextContent("him");
                        }
                        count1++;
                    }
                    pNode.getNextSibling();

                } else if (gender == 1) { // female
                    int count = 0;
                    while (count < pronounList.getLength()) {

                        if ("#he/she_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("she");
                        }

                        if ("#he/she_caps3".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("She");
                        }

                        if ("#his/her_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("her");
                        }
                        if ("#his/her_caps#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("Her");
                        }

                        if ("#him/her_lc#".equals(pronounList.item(count).getTextContent())) {
                            pronounList.item(count).setTextContent("her");
                        }
                        count++;
                    }
                    pNode.getNextSibling();
                }
            }
        }
        // write the content to file
        TransformerFactory transformerFactory = TransformerFactory.newInstance();
        Transformer transformer = transformerFactory.newTransformer();
        DOMSource source = new DOMSource(doc);

        System.out.println("-----------Modified File-----------");
        StreamResult consoleResult = new StreamResult(System.out);
        transformer.transform(source, new StreamResult(new FileOutputStream("template.xml"))); // writes changes to file
    } catch (Exception e) {
        e.printStackTrace();
    }

}

如果我能弄清楚如何将标签代词与该代码所在的代词解析器关联起来,我认为这段代码会起作用。

最佳答案

我使用了这个示例和您的 template.xml,我认为它有效。

public static void main(String[] args) {

        File inputXML = new File("template.xml");
        BufferedReader br = null;
        String newString = "";
        StringBuilder strTotale = new StringBuilder();
        try {

        FileReader reader = new FileReader(inputXML);
        String search = "#his/her_lc#";


        br = new BufferedReader(reader);
        while ((newString = br.readLine()) != null){
            newString = newString.replaceAll(search, "her");
            strTotale.append(newString);
        }

        } catch ( IOException  e) {
            // TODO Auto-generated catch block
            e.printStackTrace();
        } // calls it
        finally
        {
            try {
                br.close();
            } catch (IOException e) {
                // TODO Auto-generated catch block
                e.printStackTrace();
            }
        }


        System.out.println(strTotale.toString());


    }

首先必须重新分配replaceAll的结果:

newString = newString.replaceAll(search, "her");

其次,我使用 StringBuffer 来收集所有行。

我希望这会有所帮助。

关于java - 如何替换xml文件中字符串的一部分?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/39844028/

相关文章:

Java 泛型参数

javascript - 日历上的重复事件 : RFC 5545 Javascript parsing

sql - 从 FOR XML Sql 查询中解码文本

iphone - NSXMLParser 在无效 XML 上崩溃

c# - 从资源中读取 xml 文件

java - ANTLR(field=value),这个怎么表达呢?

java - 如何解析这个语法?

java - 在屏幕关闭时延迟接收器的启动 Activity

java - Morphia 中的复杂查询

java - 无法使用 ScalarDB 连接到我自己的 azure cosmos 数据库实例