FW: MediaDefender Proposal: Web Crawler

From: Ben Grodsky <grodsky_at_mediadefender.com>
Date: Tue, 10 Apr 2007 23:26:39 -0700

i'm guessing we'll have a call scheduled in 3 more weeks to discuss this proposal. then, we'll get to wait another 3 weeks until they decide. we'll be lucky if they agree to anything by summer....

From: Jeremy Banks [mailto:Jeremy.Banks_at_ifpi.org]
Sent: Tue 10-Apr-07 22:46
To: Ben Grodsky
Cc: Randy Saaf; Jay Mairs; Mumith Ali; Rosemary Nolan
Subject: Re: MediaDefender Proposal: Web Crawler

Hi Ben

Thanks for your note, I have cc'd Rosie on this note so she can arrange a call to discuss.



-----Original Message-----
From: Ben Grodsky <grodsky_at_mediadefender.com>
To: Jeremy Banks; Mumith Ali
CC: Randy Saaf <randy_at_mediadefender.com>; Jay Mairs <jay_at_mediadefender.com>
Sent: Wed Apr 11 04:48:38 2007
Subject: RE: MediaDefender Proposal: Web Crawler

Jeremy or Mo,

We were wondering whether you've had time to consider the below. Please let us know your thoughts.



From: Ben Grodsky
Sent: Wed 21-Mar-07 20:43
To: jeremy.banks_at_ifpi.org; mumith.ali_at_ifpi.org
Cc: Randy Saaf; Jay Mairs
Subject: MediaDefender Proposal: Web Crawler

 Jeremy and Mo,

Per our previous conversation, outlined herein is MediaDefender's proposed method to gather and transmit information to the IFPI about illegal website sources for musical tracks.

Data Collection: MediaDefender will search Google (www.google.com) for known keywords in order to generate a list of websites leading to mp3 files, using both an Automated and Human approach. While MediaDefender will endeavor to develop automated tools, so that a high volume of searching can be accommodated, MediaDefender recognizes the inherent limitations in a fully automated system and will rely heavily on human input.

        * Automated

                        * That list will then be a Focused List of sites ("Focus List") that MediaDefender iterates through at a high rate for additional known keywords.
                        * MediaDefender's system will be able to iterate through a list of over 20,000 key words, provided by IFPI.

                * Human

                        * MediaDefender Data Analysts ("Data Analysts") will also search High Priority keywords ("High Priority") several times daily, taking special note to update the Focus List as new sites generate online chatter/buzz.
                        * Data Analysts will be advanced to user verification systems, or other tests designed to circumvent automated website parsing, to facilitate thorough searching on more advanced websites.

Reporting: MediaDefender will report to Customer via an XML feed to Customer's specifications. Said feed will include the artist, album, source website, time, date and verify the link was accessible at the time crawled.

Please let us know what your thoughts about this proposal are.


Ben Grodsky
Director of Operations
MediaDefender, Inc.
W: 310.956.3355 M: 323.394.6637
AIM: grodskymd
grodsky_at_mediadefender.com <mailto:grodsky_at_mediadefender.com>
Received on Fri Sep 14 2007 - 10:56:01 BST

This archive was generated by hypermail 2.2.0 : Sun Sep 16 2007 - 22:19:47 BST