# WebService-LOC-CongRec **Repository Path**: mirrors_gitpan/WebService-LOC-CongRec ## Basic Information - **Project Name**: WebService-LOC-CongRec - **Description**: Read-only release history for WebService-LOC-CongRec - **Primary Language**: Unknown - **License**: Not specified - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 0 - **Forks**: 0 - **Created**: 2020-10-20 - **Last Updated**: 2025-11-10 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README A framework for crawling pages in the congressional record. By default, this crawls pages starting from the Daily Issues page (http://thomas.loc.gov/home/Browse.php?&n=Issues), visiting each issue in a depth-first fashion. ## Synopsis use WebService::LOC::CongRec::Crawler; use Log::Log4perl; Log::Log4perl->init_once('log4perl.conf'); $crawler = WebService::LOC::CongRec::Crawler->new(); $crawler->goForth(process => \&process_page); sub process_page { my ($day, $page) = @_; my $logger = Log::Log4perl->get_logger('thomas.pl.process_page'); $logger->info("Page #$i ID " . $page->pageID); exit if ++$i > $max; }