我正在尝试抓取本地 HTML 文件以获得页面标题我正在使用剧作家 Node Js 库,但我无法将其拉出来,请给我帮助,这里是代码
import playwright from "playwright";
async function main() {
const browser = await playwright.chromium.launch({
headless: true// setting this to true will not run the UI
});
const page = await browser.newPage();
await page.goto('C:\Users\weske\Desktop\WebScraping\index.html');
let title = await page.title();
console.log(title);
await page.waitForTimeout(5000); // wait for 5 seconds
await browser.close();
}
main();
// error messages
(node:15800) UnhandledPromiseRejectionWarning: page.goto: net::ERR_FILE_NOT_FOUND at
c:UsersweskeDesktopWebScrapingindex.html
=========================== logs ===========================
navigating to "c:UsersweskeDesktopWebScrapingindex.html", waiting until "load"
============================================================
at main (file:///C:/Users/weske/Desktop/WebScraping/node.js:11:13)
(Use `node --trace-warnings ...` to show where the warning was created)
(node:15800) UnhandledPromiseRejectionWarning: Unhandled promise rejection. This error
originated either by throwing inside of an
async function without a catch block, or by rejecting a promise which was not handled with
.catch(). To terminate the node process on unhandled promise rejection, use the CLI flag `--
unhandled-rejections=strict` (see
https://nodejs.org/api/cli.html#cli_unhandled_rejections_mode). (rejection id: 1)
(node:15800) [DEP0018] DeprecationWarning: Unhandled promise rejections are deprecated. In
the future, promise rejections that are not handled will terminate the Node.js process with a
non-zero exit code.
最佳答案
始终确保按照这种格式放置文件路径
`file://C:/Users/weske/Desktop/Webscraping/index.html`
关于node.js - 使用 playwright 抓取本地 html 文件,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/70009973/