| User Agent | | Verified | Date Added |
| panscient.com |  | Yes | 9/11/2006 22:42:55 |
At Panscient Technologies we design, build and operate custom internet search engines that unlock the hidden structure of web data.
Using state of the art AI technology, Panscient Technologies' software analyzes web sites for their information content and compiles the data into a searchable index. Our software can be trained to recognize specific entities and relations, so whatever your application, from searching product reviews to detecting new job ads, Panscient Technologies can supply a custom search engine for the task. |
| TencentTraveler |  | | 9/11/2006 22:30:13 |
| bad url encoding |
| WinPodder |  | | 5/11/2006 22:56:18 |
| podcast |
| Snappy |  | | 5/11/2006 22:54:45 |
|
| charlotte betaspider.com |  | | 5/11/2006 22:51:16 |
We are a stealth-mode startup that is indexing the web for a novel application. We plan to release this new service to the public very soon. We are not attempting to steal any copyrighted information from your site and will not be re-distributing your content. We will only be allowing users to find for your website more easily. |
| DepSpid |  | | 5/11/2006 22:43:42 |
| DepSpid is a distributed kind of a web crawler. The DepSpid spider visits domains, analyses links and finally calculates scores about the link dependencies between individual domains. Each spider job starts at the main page of a domain and then follows each link on that page retrieving more pages and analysing them, too. The spider stays within one domain. If it finds an external link it only checks if the linked domain is reachable but doesn't continue crawling into the external domain. Every unknown domain will be visited from another spider job at a later time. |
| Mozilla/7.0 |  | | 5/11/2006 22:39:17 |
|
| Pingdom |  | Yes | 5/11/2006 22:37:19 |
| Web site monitoring |
| DataCha0s |  | | 5/11/2006 11:54:03 |
| exploit tool that scans for Perl Awstats |
| AR |  | | 5/11/2006 11:48:03 |
| from Korea |
| bsalsa Bad User Agent |  | | 4/11/2006 23:17:21 |
| user agent is not RFC compliant! |
| Java bad url parsing bots |  | | 2/11/2006 23:39:13 |
| bad url parsing |
| ISC Systems iRc Search |  | | 1/11/2006 23:17:41 |
| fell for bad bot trap |
| WebTraq |  | | 25/10/2006 23:49:47 |
| uses fake user agent: Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1) |
| Mail.ru Agent |  | | 22/10/2006 23:54:37 |
| Mail.ru Agent (IM/VoIP) |
| Fake IE No Windows |  | | 18/10/2006 0:26:13 |
| strange referer: c:/Documents and Settings/kde/Desktop/linksQ12003.htm |
| Mozilla(IE Compatible) |  | | 18/10/2006 0:12:22 |
|
| ArbeFavIcons |  | | 17/10/2006 17:55:56 |
|
| Fake Mozilla 5 on Windows NT 4 |  | | 16/10/2006 22:38:39 |
|
| Fake IE compatible ; MSIE |  | | 16/10/2006 15:56:59 |
| fell for bad bot trap + aggressive |
| Gmane.org favicon grabber |  | | 15/10/2006 22:23:20 |
|
| EDI |  | | 15/10/2006 22:21:10 |
|
| Drupal |  | | 15/10/2006 15:28:51 |
|
| HTMLParser |  | | 15/10/2006 15:26:25 |
|