✍️ 🧑‍🦱 💚 Autor:innen verdienen bei uns doppelt. Dank euch haben sie so schon 367.705 € mehr verdient. → Mehr erfahren 💪 📚 🙏

Learning To Crawl Web Forums

Learning To Crawl Web Forums

von Vipul Punjabi
Softcover - 9786135812343
35,90 €
  • Versandkostenfrei
Auf meine Merkliste
  • Hinweis: Print on Demand. Lieferbar in 5 Tagen.
  • Lieferzeit nach Versand: ca. 1-2 Tage
  • inkl. MwSt. & Versandkosten (innerhalb Deutschlands)

Autorenfreundlich Bücher kaufen?!

Beschreibung

Present Forum Crawler Under Supervision (FoCUS), a supervised web-scale forum crawler. The goal of FoCUS is to crawl relevant forum content from the web with minimal overhead. Forum threads contain information content that is the target of forum crawlers. Although forums have di¿erent layouts or styles and are powered by di¿erent forum software packages, they always have similar implicit navigation paths connected by speci c URL types to lead users from entry pages to thread pages. Based on this observation, we reduce the web forum crawling problem to a URL-type recognition problem. And we show how to learn accurate and e¿ective regular expression patterns of implicit navigation paths from automatically created training sets using aggregated results from weak page type classi ers. Robust page type clas-si ers can be trained from as few as ve annotated forums and applied to a large set of unseen forums.

Details

Verlag LAP LAMBERT Academic Publishing
Ersterscheinung 10. Januar 2018
Maße 22 cm x 15 cm x 0.4 cm
Gewicht 107 Gramm
Format Softcover
ISBN-13 9786135812343
Seiten 60