我有 js 和不同的字符串,如下所示:
Tue Aug 11 2015 between 4:00 PM and 5:00 PM
之间的单词
和and
未更改
但有时我可以得到这个字符串,单词之间有不同数量的空格
Tue Aug 11 2015 between 4:00 PM and 5:00 PM (3 spaces)
或
Tue Aug 11 2015 between 4:00 PM and 5:00 PM (1 spaces)
是否可以为此字符串创建正则表达式?
string re1="((?:Monday|Tuesday|Wednesday|Thursday|Friday|Saturday|Sunday|Tues|Thur|Thurs|Sun|Mon|Tue|Wed|Thu|Fri|Sat))"; // Day Of Week 1
string re2="(\\s+)"; // White Space 1
string re3="((?:Jan(?:uary)?|Feb(?:ruary)?|Mar(?:ch)?|Apr(?:il)?|May|Jun(?:e)?|Jul(?:y)?|Aug(?:ust)?|Sep(?:tember)?|Sept|Oct(?:ober)?|Nov(?:ember)?|Dec(?:ember)?))"; // Month 1
string re4="(\\s+)"; // White Space 2
string re5="((?:(?:[0-2]?\\d{1})|(?:[3][01]{1})))(?![\\d])"; // Day 1
string re6="(\\s+)"; // White Space 3
string re7="((?:(?:[1]{1}\\d{1}\\d{1}\\d{1})|(?:[2]{1}\\d{3})))(?![\\d])"; // Year 1
string re8="(\\s+)"; // White Space 4
string re9="(\"between\")"; // Double Quote String 1
string re10="(\\s+)"; // White Space 5
string re11="((?:(?:[0-1][0-9])|(?:[2][0-3])|(?:[0-9])):(?:[0-5][0-9])(?::[0-5][0-9])?(?:\\s?(?:am|AM|pm|PM))?)"; // HourMinuteSec 1
string re12="(\\s+)"; // White Space 6
string re13="(\"and\")"; // Double Quote String 2
string re14="(\\s+)"; // White Space 7
string re15="((?:(?:[0-1][0-9])|(?:[2][0-3])|(?:[0-9])):(?:[0-5][0-9])(?::[0-5][0-9])?(?:\\s?(?:am|AM|pm|PM))?)"; // HourMinuteSec 2
如何简化这一行的正则表达式?
最佳答案
这是我重新使用您的代码的尝试:
var re1="((?:Monday|Tuesday|Wednesday|Thursday|Friday|Saturday|Sunday|Tues|Thur|Thurs|Sun|Mon|Tue|Wed|Thu|Fri|Sat))"; // Day Of Week 1
var re2="\\s+"; // White Space 1
var re3="((?:Jan(?:uary)?|Feb(?:ruary)?|Mar(?:ch)?|Apr(?:il)?|May|Jun(?:e)?|Jul(?:y)?|Aug(?:ust)?|Sep(?:tember)?|Sept|Oct(?:ober)?|Nov(?:ember)?|Dec(?:ember)?))"; // Month 1
var re5="((?:(?:[0-2]?\\d)|(?:3[01])))(?!\\d)"; // Day 1
var re7="(\\b(?:1\\d{3}|2\\d{3})\\b)"; // Year 1
var re11="((?:[0-1][0-9]|2[0-3]|[0-9]):[0-5][0-9](?::[0-5][0-9])?(?:\\s*(?:am|AM|pm|PM))?)"; // HourMinuteSec
var reDay = "\\b((?:0?\\d|[12]\\d|3[01]))\\b";
var s = "Tue Aug 11 2015 between 4:00 PM and 5:00 PM";
var rx = RegExp(re1 + re2 + re3 + re2 + reDay + re2 + re7 + re2 + "between" + re2 + re11 + re2 + "and" + re2 + re11, 'i');
if ((m = rx.exec(s)) !== null) {
document.write("Day of week: " + m[1] + "<br/>");
document.write("Month: " + m[2] + "<br/>");
document.write("Day: " + m[3] + "<br/>");
document.write("Year: " + m[4] + "<br/>");
document.write("From: " + m[5] + "<br/>");
document.write("Till: " + m[6]);
}
请注意,我没有捕获空格(删除了括号),添加了一个 reDay
几天,仅使用 \b\d{1,2}\捕获两个数字作为整个单词b
,我已经了解了您的一些正则表达式(删除了不必要的括号),并通过将 \s?
更改为 \s*
来修复时间正则表达式。看来这是主要问题,因为 ?
代表出现 0 次或 1 次,而 *
表示出现 0 次或多次 em>.
关于javascript - JS : Regular expression for string,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/31934455/