java - java POI 按列名读取Excel

标签 java excel apache-poi

下午好专家有一个问题,我需要按列名而不是索引读取整个 Excel 文件,示例:

Column1 | Column2 | Column3
data1      data 2    data 3

POI允许我通过getColumnIndex()方法读取列索引返回Column1 = 0 , Column2= 1 etc,但我需要按列名 Column1 来阅读它, Column2 ETC , 有什么办法可以做到这一点吗??

我需要按列名称读取行和列的所有字段。 附上我读取文件的代码:

更新的代码:

import org.apache.poi.ss.usermodel.Cell;
import org.apache.poi.ss.usermodel.DataFormatter;
import org.apache.poi.ss.usermodel.Row;
import org.apache.poi.xssf.usermodel.XSSFSheet;
import org.apache.poi.xssf.usermodel.XSSFWorkbook;

import java.io.File;
import java.io.FileInputStream;
import java.io.FileNotFoundException;
import java.io.IOException;
import java.util.Iterator;
import java.util.regex.Matcher;
import java.util.regex.Pattern;


public class example {

    DataFormatter fmt = new DataFormatter();

    /**
     * @param args the command line arguments
     */
    public static void main(String[] args) throws FileNotFoundException, IOException {
        example softMarti = new example();
        FileInputStream file = new FileInputStream(new File("C:archive.xlsx"));
        XSSFWorkbook workbook = new XSSFWorkbook(file);
        XSSFSheet sheet = workbook.getSheetAt(0);
        Iterator<Row> rowIterator = sheet.iterator();
        while (rowIterator.hasNext()) {
            Row row = rowIterator.next();
            int rowIndex = row.getRowNum();
            if (rowIndex < 1) {
                continue;
            }
            Iterator<Cell> cellIterator = row.cellIterator();

            while (cellIterator.hasNext()) {
                Cell cell = cellIterator.next();
                int columnIndex = cell.getColumnIndex();
                if (columnIndex != 0 && columnIndex != 1 && columnIndex != 4) {
                    continue;
                }
                String columnName = "";
                switch (columnIndex) {
                    case 0:
                        columnName = "column1";
                        break;
                    case 1:
                        columnName = "column2";
                        break;
                    case 4:
                        columnName = "column 4";
                        break;
                }
                String value = example.getValue(cell);
                boolean valid = example.isValid(columnIndex, value);
                if (valid) {
                    continue;
                }
                System.out.print(columnName + rowIndex);
                System.out.println(" -> " + value);

            }

        }

        // TODO code application logic here
    }

    private String getValue(Cell cell) {
        switch (cell.getCellType()) {
            case Cell.CELL_TYPE_BLANK:
                return null;
            case Cell.CELL_TYPE_BOOLEAN:
                return "CELL_TYPE_BOOLEAN";
            case Cell.CELL_TYPE_ERROR:
                return "CELL_TYPE_ERROR";
            case Cell.CELL_TYPE_FORMULA:
                return "CELL_TYPE_FORMULA";
            case Cell.CELL_TYPE_NUMERIC:
                return fmt.formatCellValue(cell);
            case Cell.CELL_TYPE_STRING:
                return cell.getStringCellValue();
            default:
                return "none";

        }

    }

    boolean isValid(int column, String value) {
        if (value == null) {
            return false;
        }
        String pattern = "";
        switch (column) {
            case 0:
                pattern = "[A-Za-z0-9_\\- ]{1,20}";
                break;
            case 1:
                pattern = "[A-Za-z0-9_\\- ]{1,80}";
                break;
            case 4:
                pattern = "[0-9]{1,8}";
                break;
        }
        Pattern pat = Pattern.compile(pattern);
        Matcher mat = pat.matcher(value);
        return mat.matches();
    }
}

此代码有效,但我需要验证列名称,因为对于我的项目,列可能会改变位置,这是我的目标

最佳答案

为什么不读取第一行(0)单元格值(0-n)(又名列名称)并将(columnName,columnIndex)放入 String/int 的映射中。然后您可以按名称引用列索引。

这是一个例子:

Map<String, Integer> map = new HashMap<String,Integer>(); //Create map
HSSFRow row = sheet.getRow(0); //Get first row
//following is boilerplate from the java doc
short minColIx = row.getFirstCellNum(); //get the first column index for a row
short maxColIx = row.getLastCellNum(); //get the last column index for a row
for(short colIx=minColIx; colIx<maxColIx; colIx++) { //loop from first to last index
   HSSFCell cell = row.getCell(colIx); //get the cell
   map.put(cell.getStringCellValue(),cell.getColumnIndex()) //add the cell contents (name of column) and cell index to the map
 }

在此之后,您将获得来自columnName --->index.html 的 map 。然后你可以这样做:

int idx = map.get("ColumnName");

...您可以在 row.getCell(idx) 中使用它来获取所有其他行中的单元格。

阅读代码中下面的注释。除了这个我帮不了你。您需要阅读文档并弄清楚如何操作。

Workbook workbook = WorkbookFactory.create(new FileInputStream("C:\\file.xlsx"));

Sheet sheet = workbook.getSheetAt(0);
totalRows = sheet.getPhysicalNumberOfRows();

Map<String, Integer> map = new HashMap<String,Integer>(); //Create map
HSSFRow row = sheet.getRow(0); //Get first row
//following is boilerplate from the java doc
short minColIx = row.getFirstCellNum(); //get the first column index for a row
short maxColIx = row.getLastCellNum(); //get the last column index for a row
for(short colIx=minColIx; colIx<maxColIx; colIx++) { //loop from first to last index
HSSFCell cell = row.getCell(colIx); //get the cell
map.put(cell.getStringCellValue(),cell.getColumnIndex()) //add the cell contents (name of column) and cell index to the map
}

List<ReportRow> listOfDataFromReport = new ArrayList<ReportRow>();
for(int x = 1; x<=totalRows; x++){
 ReportRow rr = new ReportRow(); //Data structure to hold the data from the xls file.
 HSSFRow dataRow = sheet.getRow(x); //get row 1 to row n (rows containing data)

 int idxForColumn1 = map.get("Column1"); //get the column index for the column with header name = "Column1"
 int idxForColumn2 = map.get("Column2"); //get the column index for the column with header name = "Column2"
 int idxForColumn3 = map.get("Column3"); //get the column index for the column with header name = "Column3"

 HSSFCell cell1 = dataRow.getCell(idxForColumn1) //Get the cells for each of the indexes
 HSSFCell cell2 = dataRow.getCell(idxForColumn2) 
 HSSFCell cell3 = dataRow.getCell(idxForColumn3)  

 //NOTE THAT YOU HAVE TO KNOW THE DATA TYPES OF THE DATA YOU'RE EXTRACTING.
 //FOR EXAMPLE I DON'T THINK YOU CAN USE cell.getStringCellValue IF YOU'RE TRYING TO GET A NUMBER
 rr.setColumn1(cell1.getStringCellValue()); //Get the values out of those cells and put them into the report row object
 rr.setColumn2(cell2.getStringCellValue());
 rr.setColumn3(cell3.getStringCellValue());

 listOfDataFromReport.add(rr);

}

//Now you have a list of report rows
for(int j = 0; j< listOfDataFromReport.size();j++){
   System.out.println("Column 1 Value: " +   listOfDataFromReport.get(j).getColumn1())
//etc...    
}

//This class holds the values from the xls file.  You may not need it
// I have no idea what you're doing with the data.  If you simply wanted to 
//print the data to console you wouldn't need it.
public static class ReportRow{
private String column1;
private String column2;
private String column3;

public String getColumn1(){
    return this.column1;
}
public void setColumn1(String column1){
    this.column1 = column1;
}   

public String getColumn2(){
    return this.column2;
}
public void setColumn2(String column2){
    this.column2 = column2;
}       
public String getColumn3(){
    return this.column3;
}
public void setColumn3(String column3){
    this.column3 = column3;
}   
}

关于java - java POI 按列名读取Excel,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35905135/

相关文章:

多条件EXCEL公式

arrays - 编译错误: ByRef arugment type mismatch: How to call a value in an array

java - 避免 Word 在 Apache POI 生成的 .doc 开头显示空白页

java - 正确的 HTTP GET 请求

java - 如何在Java中访问rowSet的返回结果

perl - 使用perl excel win32 ole检索Excel单元格背景颜色

apache-poi - 将 excel 文件读写到 MarkLogic

java - POI - 在 Excel 中打开时无法写入文件?

java - 更新 JPanel/JFrame 不起作用

java - 与 JSF 2.1 一起使用的 JSTL 版本