CIS 555 F P P: P ' F S E: Inal Roject Oogle ENN S Avorite Earch Ngine

This document describes a search engine project called Poogle that was developed by four students for a class. Poogle crawls the web, indexes pages, calculates page rankings using PageRank, and responds to user search queries. It uses Amazon Web Services machines and is designed to be scalable, fault tolerant, and to return high-quality search results. The project was developed over several weeks and involved building modules for crawling, indexing, PageRank calculation, and query response.

Uploaded by

Rajesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views5 pages

CIS 555 F P P: P ' F S E: Inal Roject Oogle ENN S Avorite Earch Ngine

Uploaded by

Rajesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CIS 555 FINAL PROJECT

POOGLE: PENN’S FAVORITE SEARCH ENGINE

Sanjay Paul, Levi Cai, Dan Kim, and Federico Nusymowicz
{sanjayp, cail, dki, fnusy}@seas.upenn.edu

Professor Andreas Haeberlen

ahae@seas.upenn.edu

ABSTRACT

Poogle is a search engine that runs on a distributed network of AWS machines. This paper
describes, analyzes and evaluates the Poogle system.

I. INTRODUCTION
A. Project Goals
We built Poogle with certain objectives in mind.
Our primary goals included:
• Efficiently crawling a large corpus
• Responding to several concurrent queries in a
timely fashion
• Providing a clean user interface
• Serving up high-quality search results
• Developing a highly scalable system able to
operate across a large network of peers
B. High-Level Approach
Our system operates in three distinct phases.
Phase 1 consists of a process that crawls the web,
and then indexes pages as they become available.
The second phase calculates PageRank for all
indexed content and readies the system to answer
queries efficiently. Finally, a third process Figure 1: High-Level Approach
instantiates a server that answers user queries by
calculating document rankings and returning • 5/3/2012: PageRank operational.
search results in order of relevance. Figure 1 • 5/5/2012: server module functional.
depicts the high-level approach. • 5/6/2012: tuned ranking function.
Each of the processes relies upon a distributed • 5/8/2012: finished integrating components.
network of peer nodes. Individual nodes are
D. Division of Labor
responsible for both running the modules and for
storing portions of the database. We explain our We worked together as much as we could and
architecture in further detail in Section II. pair-programmed relatively often, which helped
ease component integration towards the end of the
C. Project Timeline project. Each team member contributed to most
• 4/18/2012: crawler operational. modules. We also assigned individual
• 4/21/2012: indexer operational. accountability for certain specific tasks-
• 4/28/2012: crawler and indexer integrated; Sanjay Paul: crawler functionality and Pastry
combined module operational across variable- ring management.
sized node networks.
Levi Cai: indexer functionality and Pastry
• 4/29/2012: user interface completed.
network testing.

1
Dan Kim: user interface and PageRank module.
Federico Nusymowicz: server module and
BerkeleyDB/AMI management.

II. ARCHITECTURE
A. Database
We store our data persistently using BerkleyDB.
Our implementation distributes BDB data
structures across several FreePastry peer nodes,
where each peer node takes responsibility for a
subset of the data. Our most relevant data
structures include:
• Pages, which contain the raw XML/HTML
content downloaded from a given URL,
PageRank information, and a list of the page’s
outgoing links. Each peer node takes
responsibility for a subset of URL hosts.
• HitLists, which mimic the Google data
structure going by the same name [1]. Each
HitList corresponds to a specific word within Figure 2: 2-Node Crawler/Indexer
a Page. HitLists also count the number of
times the term occurs within a document,
the link gets discarded. Otherwise the link gets
maintain position information for each word
added to the back of the node’s URL queue.
occurrence, and hold additional term-ranking
information, such as whether the term was a As soon as pages become available, the indexer
‘fancy hit’ (i.e. a title, header, or meta parses them one by one. Parsing entails forming
information). HitLists for each word in the content body and
• HitBins, which aggregate all the HitLists for a then calculating the term’s TF factor. Figure 2
specific word. Individual nodes in the depicts a 2-node crawler & indexer process.
network take responsibility for storing a We designed our crawler / indexer with fault
subset of word HitBins. tolerance in mind. The process regularly
As shown in Figure 1, BDB data structures checkpoints by syncing with disk, thereby
serve as the main point of interface between our allowing the crawler and indexer to pick up from
three component processes. where they stopped in the event of a crash. The
process is also highly scalable and stable – we
B. Crawler / Indexer successfully ran it across 10 nodes and crawled
Our crawler’s architecture borrows heavily from continuously with no problems.
Mercator’s design [2], with some simplifications Our Phase 1 architecture’s main drawback
made in the interest of reducing development regarding scalability was the fact that new nodes
overhead. Each crawler node operates as follows: could not dynamically join or leave the process
1. Poll the local BDB’s URL queue. without impacting the overall network’s
2. Enforce politeness; if the URL’s host was execution. A possible future improvement could
recently pinged, move the URL to the back of involve periodic checks to dynamically
queue and repeat step 1. redistribute Pages.
3. Download the HTML/XML content and store After giving the process its shutdown signal, the
it as a Page in the local BDB. crawler stops running and the indexer sorts
4. Extract all links and route each one to the HitBins by TF factor, in order to later improve our
node responsible for the URL’s host. servers’ query response speeds.
5. Go back to step 1.
B. PageRank
Whenever a crawler node receives a link, it
checks whether the URL is a duplicate, and if so After halting the crawler / indexer process, our
system moves on to compute PageRank. The

2
PageRank calculation begins by aggregating Page We implemented a Tomcat servlet in order to
data at a single master node. This allowed for route queries to our servers. The queries
better ordering of our results, enabling us to return themselves then got hashed and routed through the
more authoritative, reliable sources. We Pastry ring, thus balancing computing load across
implemented PageRank using an iterative, pseudo- all server nodes and improving mean response
distributed Hadoop job. The results from crawling time.
were then aggregated and fed into Hadoop. Routing queries through Pastry provided the
Since calculating a URL's PageRank relies on unexpected side effect of fault tolerance, since
the rank of pages that link into that URL, and even in the event that a server node crashed, the
since outgoing links compose other page’s ranks search engine still remained operational.
in turn, we needed to iterate a large number of
times in order to come to an acceptable result. D. Ranking
Our map algorithm accepted URLs along with Every returned HitBin contains HitLists sorted
their associated page ranks and outgoing links. by TF factor, which the indexer computes as:
Then, in the combining stage, we aggregated an freq(word,doc) / max[freq(any word,doc)]
entry’s PageRank by adding the scaled ranks of all …where word counts are double-weighted for
incoming URLs. We iteratively used these entries each of the ‘fancy hits’ described earlier. Servers
in the reducing stage to calculate page ranks until also gather the HitBin’s total size (n) when they
reaching some degree of convergence. retrieve the bin’s top 10,000 entries. Additionally,
Defining the convergence function proved to be servers communicate with each other at startup in
a fairly non-trivial task. The iterative reduce step order to calculate the total size of the corpus (N).
took a long time to run (hours), and as an To determine a HitList’s TF-IDF weight, servers
additional challenge, determining the proper calculate:
threshold for ‘convergence’ proved to be more of TF-IDF = TF * log(N/n)
an art than a science. In the interest of time, we
chose a less sophisticated approach: set number Servers then weigh each query term according
iteration. We found that although set number to the formula:
iteration proved more inaccurate than a rigid wquery(word) = 0.5 + [0.5*freq(word,query) /
definition of ‘diff’ convergence, with a sufficient max(freq(any word,query))] * log(N/n)
number of iterations, the end values varied little
For each Page referred to in one of the retrieved
enough to consider them fairly accurate
HitLists, servers then calculate the cosine
PageRanks. As an added benefit, set number
similarity between the query and the page, and
iteration helped us predict the process’ runtime.
scale by the Page’s PageRank in order to calculate
Once the PageRank algorithm completed, we the final ranking score.
distributed PageRank scores across all network
nodes and appended them to HitLists in order to
later improve server response time. III. EVALUATION
A. Crawler / Indexer
C. Server
The crawler and indexer were run as a coupled
After all HitLists got updated with their relevant
pair in a single JVM process on each of ten total
PageRank scores, we switched each node to server
Amazon Machine Instances on the EC2 cloud.
mode. Servers then began actively answering
They were allocated a virtual memory size bound
queries. More specifically, servers:
in the range between 1 and 2 MBs (min/max).
1. Listened for query requests. Overflow data was pushed to the key-value store
2. Split queries by term and requested the provided by the open-sourced Berkeley DB
corresponding HitBins from other servers. distribution and top-level caching (as well as that
3. Waited for HitBins to return and cached provided by Berkeley DB) was exploited to
HitBins for the most popular terms. Servers improve performance. Resource allocation was
only retrieved the top 10,000 entries (based distributed across the myriad and performance-
on each entry’s TF factor) from a given throttling resource sinks present in each process,
HitBin in order to improve retrieval speed. and these were primarily directed towards tracking
4. Servers then calculated document ranking duplicate encountered URLs (cached), the URL
based on an augmented TF-IDF vector model. expansion frontier (breadth-first), and space
5. Finally, servers returned the query results. occupied by indexer entries en route between

3
machine nodes and from main memory to disk In a crawling run of approximately 100 minutes,
storage. Each component was allocated an we achieved an indexed corpus size of about
explicitly bound capacity that varied by priority ~128K pages. The story, however, was markedly
and relative speed. different on an individual machine-to-machine
The internal construction of the crawler-indexer basis. In short, many machines went massively
reflected their outward performance characteristic under-utilized due to discrepancies caused by
in that the crawler thread pool was relatively small inconsistent hashing. Whereas the machine nodes
(nine total was found to be an optimal value) and attempt enforce a uniform hash, the distribution of
the indexer thread pool much larger (thirty threads hashed content (in this case of the URLs and
was used though less testing was done to pinpoint keywords) achieved a perceptible skew. Future
an optimal). This follows naturally from the fact utilization of the crawler-indexer might consider
that much of the crawler’s operation is network-IO improving its behavior by either virtualizing nodes
bound due to intra-node URL passing and page (and allowing dynamic reassignment of hashes)
downloading with some unavoidable disk and/or leveraging machine learning principles to
overhead due to a non-negligible cache miss rate improve the system’s anticipated distribution of
(i.e. often >20%, but still tolerable due to locality). hashed content.
On analysis, the average crawl spent less than B. Server
10% of its time performing computations – the
rest was due to blocked I/O. By contrast, the Our system was designed to provide fast results,
indexer had much more opportunity to exploit where much of the system’s time was spent on
parallelization due to its inherently compute- pre-processing results. As a result, each page on
bound primary operations. An addendum in this average had about 80B of associated data, after the
regard was the fact that indexed buckets needed to nearly 1.5 hour crawl and indexing session, the
be shuffled between machine nodes, resulting in fastest server had nearly 2.2GB of data stored.
large messages being directly routed to nodes Result retrievals of single words were near
(based on hash). instantaneous regardless if it was stored in the
cache or not, and multi-word searches were not
Aside from using a SLRU cache replacement much longer.
policy on important top-level objects, we managed “This is the coolest place on earth” returned in
to achieve high performance by adopting a “route- approximately 2.7 seconds and when queried
to-final” policy – in effect, any object (i.e. indexer again it returned on average in 1.2 seconds due to
entry, URL, etc) that was to be pushed from a the cache. Several other 7+ word queries returned
source to a sink within the system was buffered at similar time intervals with over several hundred
and hastily evicted to its final destination. The results each. Single word entries are much faster.
objective here was to clear space in memory by
making some bandwidth concessions and to
minimize time wastage caused by objects pending IV. LESSONS LEARNED
a push to the endpoint machine instance. This was Building a search engine was an incredibly rich
an exceedingly effective practice, though it led, experience – there were countless opportunities
not surprisingly, to the side effect of high- for optimizations, tweaks, and improvements.
frequency message passing and overflow within Some of our ideas proved so interesting that we
the system. The FreePastry distributed hashing often had trouble focusing on completing the
scheme utilized to coordinate the machines proved project’s most basic features. We focused so much
to be less tolerant to high network traffic than we on optimizing our crawler, for example, that by
anticipated and we had to implement coherent the end of the project we barely had time to refine
buffering, throttling, and node-rotation schemes in our PageRank algorithm. If we had the chance to
order to achieve robustness, which was strained by start over, we would definitely get all the basic
adverse message queue overflows and locking features working on a basic level before starting to
exceptions due to timeouts. While buffering (with polish any part of our code base.
direct message-passing) and throttling were fairly
We also learned about the importance of clear
intuitive countermeasures, we also chose to cycle
interfaces. After our first few nights of
messaging allowance by triggering nodes to flush
programming together, we all thought we had a
their buffers to the system in sequence.
relatively thorough understanding of each
The crawler and indexer performed well above component’s architecture and required data
expectation with an average throughput of 160 structures. After splitting the work, however, we
pages crawled and 117 pages indexed per minute. quickly realized how wrong we were. Lack of

4
clearly specified interfaces ended up costing us robustness to scalability and high-performance
countless hours of integration effort. operation. The lessons we take away from the
experience will undoubtedly carry with us well
into the future.
IV. CONCLUSION
In summary, we unanimously concur that
architecting this system was by far the most REFERENCES
challenging task any of us have ever undertaken, [1] S. Brin and L. Page. “The Anatomy of a Large-
but also among the most rewarding. We gained an Scale Hypertextual Search Engine”. Stanford
appreciation for a wide breadth of challenges University. Computer Science Department, 1998.
inherent to massively scaled system design and the
[2] M. Najork and A. Heydon. “High-Performance
solutions they necessitated. On a fundamental
Web Crawling”. Kulwer Academic Publishers, Inc.
level, a search engine handily incorporates almost Compaq SRC, September 2001.
every relevant facet of distributed system design,
ranging from parallelization considerations to

Web Info PDF
No ratings yet
Web Info PDF
4 pages
Crawler: 1.0 Introduction
No ratings yet
Crawler: 1.0 Introduction
12 pages
Web Crawling: Based On The Slides by Filippo
No ratings yet
Web Crawling: Based On The Slides by Filippo
52 pages
A Scalable, Distributed Web-Crawler
No ratings yet
A Scalable, Distributed Web-Crawler
8 pages
Commoncrawlpresentation 101027182938 Phpapp02
No ratings yet
Commoncrawlpresentation 101027182938 Phpapp02
17 pages
Crawler and URL Retrieving & Queuing
No ratings yet
Crawler and URL Retrieving & Queuing
5 pages
CIS 455/555: Internet and Web Systems: Crawling and Publish/Subscribe February 15, 2012
No ratings yet
CIS 455/555: Internet and Web Systems: Crawling and Publish/Subscribe February 15, 2012
34 pages
Completed Final UNIT-V 9.10.17
100% (1)
Completed Final UNIT-V 9.10.17
74 pages
Web Spider Design for Efficient Search
No ratings yet
Web Spider Design for Efficient Search
8 pages
MS CS Manipal University Ashish Kumar Jha Data Structures and Algorithms Used in Search Engine
No ratings yet
MS CS Manipal University Ashish Kumar Jha Data Structures and Algorithms Used in Search Engine
13 pages
Keyw Word Quer Ry Based D Focused Dwebc Rawler: Sciencedirect
No ratings yet
Keyw Word Quer Ry Based D Focused Dwebc Rawler: Sciencedirect
7 pages
Web Crawling for Linguistics Students
No ratings yet
Web Crawling for Linguistics Students
8 pages
CS32 Student Search Engine Project
No ratings yet
CS32 Student Search Engine Project
26 pages
IR-UNIT 10 (Web Crawling)
No ratings yet
IR-UNIT 10 (Web Crawling)
62 pages
Web Crawling: Christopher Olston and Marc Najork
No ratings yet
Web Crawling: Christopher Olston and Marc Najork
49 pages
Web Crawling for Search Engines
No ratings yet
Web Crawling for Search Engines
14 pages
CS571 Note
No ratings yet
CS571 Note
2 pages
Design and Implementation of A Simple Web Search E
No ratings yet
Design and Implementation of A Simple Web Search E
9 pages
Major Project PROPOSAL-BACHELOR OF ENGINEERING
No ratings yet
Major Project PROPOSAL-BACHELOR OF ENGINEERING
37 pages
Crawling The Web: Seed Page and Then Uses The External Links Within It To Attend To Other Pages
No ratings yet
Crawling The Web: Seed Page and Then Uses The External Links Within It To Attend To Other Pages
25 pages
Python Web Crawler Guide
No ratings yet
Python Web Crawler Guide
10 pages
Crawling The Web: Information Retrieval © Crista Lopes, UCI
No ratings yet
Crawling The Web: Information Retrieval © Crista Lopes, UCI
25 pages
Web Crawlers: Presented By: B. Tech. Final Year Information Technology
No ratings yet
Web Crawlers: Presented By: B. Tech. Final Year Information Technology
27 pages
5.web Crawler Writeup
No ratings yet
5.web Crawler Writeup
7 pages
Erformance Valuation EB Rawler: P E O W C
No ratings yet
Erformance Valuation EB Rawler: P E O W C
34 pages
Standard Web Search Engine Architecture: User Query
No ratings yet
Standard Web Search Engine Architecture: User Query
101 pages
1941
No ratings yet
1941
105 pages
Crawlwave: A Distributed Crawler: Apostolos@Kritikopoulos - Info, Sideri@Aueb - GR, Circular@
No ratings yet
Crawlwave: A Distributed Crawler: Apostolos@Kritikopoulos - Info, Sideri@Aueb - GR, Circular@
10 pages
Search Engine
No ratings yet
Search Engine
42 pages
The Anatomy of A Large-Scale Hypertextual
No ratings yet
The Anatomy of A Large-Scale Hypertextual
41 pages
Search Engine
100% (2)
Search Engine
42 pages
08 Web Search and Web Crawling
No ratings yet
08 Web Search and Web Crawling
33 pages
Ir 5
No ratings yet
Ir 5
18 pages
Adaptive Focus
No ratings yet
Adaptive Focus
6 pages
The Design and Implementation of Erachnid: An Extensible, Scalable Web Crawler in Erlang
No ratings yet
The Design and Implementation of Erachnid: An Extensible, Scalable Web Crawler in Erlang
10 pages
Hidden Web Search Engine Survey
No ratings yet
Hidden Web Search Engine Survey
22 pages
Web Crawling and Search Engine Basics
No ratings yet
Web Crawling and Search Engine Basics
40 pages
Backlinks - Pagerank
No ratings yet
Backlinks - Pagerank
12 pages
Architectural Design and Evaluation of An Efficient Web-Crawling System
No ratings yet
Architectural Design and Evaluation of An Efficient Web-Crawling System
8 pages
A Keyword Focused Web Crawler Using Domain Engineering and Ontology
No ratings yet
A Keyword Focused Web Crawler Using Domain Engineering and Ontology
3 pages
Irt Unit3
No ratings yet
Irt Unit3
50 pages
IR Unit 3
No ratings yet
IR Unit 3
64 pages
Artificial Intellegence Project Module - I: Contemporary Curriculum, Pedagogy, and Practice (C2P2) BY
No ratings yet
Artificial Intellegence Project Module - I: Contemporary Curriculum, Pedagogy, and Practice (C2P2) BY
5 pages
Efficient Web Crawler Project SRS
No ratings yet
Efficient Web Crawler Project SRS
7 pages
Python Design and Implementation of A Simple Web Search E
No ratings yet
Python Design and Implementation of A Simple Web Search E
9 pages
UNIT III-Web Crawlers Why Do We Need Web Crawlers?
No ratings yet
UNIT III-Web Crawlers Why Do We Need Web Crawlers?
19 pages
Chapter 3
No ratings yet
Chapter 3
39 pages
Arasu 2001
No ratings yet
Arasu 2001
42 pages
Seminar Report: Submitted By: Aanchal Garg CSE
No ratings yet
Seminar Report: Submitted By: Aanchal Garg CSE
22 pages
Geo Dist Crawler
No ratings yet
Geo Dist Crawler
10 pages
Anatomy of A Large-Scale Hypertextual Web Search Engine
No ratings yet
Anatomy of A Large-Scale Hypertextual Web Search Engine
33 pages
Report Format
No ratings yet
Report Format
15 pages
System Design
No ratings yet
System Design
56 pages
Ms. Poonam Sinai Kenkre
No ratings yet
Ms. Poonam Sinai Kenkre
43 pages
Text
No ratings yet
Text
5 pages
Deep Learning Benchmarks on MIMIC-III
No ratings yet
Deep Learning Benchmarks on MIMIC-III
36 pages
UserGuide PDF
No ratings yet
UserGuide PDF
76 pages
Null
No ratings yet
Null
1 page
Dickerson Road: Parkwood
No ratings yet
Dickerson Road: Parkwood
2 pages
Evolution of Traditional Thai Massage
No ratings yet
Evolution of Traditional Thai Massage
5 pages
Compos Hollywood
No ratings yet
Compos Hollywood
7 pages
Jaimini Maharishis Upadesa Sutras PDF
No ratings yet
Jaimini Maharishis Upadesa Sutras PDF
9 pages
VARGA D/10 or DASAMA "MAHAT PHALAM" DASAMSA OR KARMAMSA OR SWARGAMSA
100% (3)
VARGA D/10 or DASAMA "MAHAT PHALAM" DASAMSA OR KARMAMSA OR SWARGAMSA
11 pages
Astro
No ratings yet
Astro
28 pages
Personal Intervention Ultima Pantalla - PPT 2011
No ratings yet
Personal Intervention Ultima Pantalla - PPT 2011
1 page
RAIM Algorithm Prediction
No ratings yet
RAIM Algorithm Prediction
4 pages
PDF Direct Virtual Device Datasheet
No ratings yet
PDF Direct Virtual Device Datasheet
1 page
Compos Hollywood
No ratings yet
Compos Hollywood
7 pages
The Light-Makeup Advantage in Facial Processing: Evidence From Event-Related Potentials
No ratings yet
The Light-Makeup Advantage in Facial Processing: Evidence From Event-Related Potentials
15 pages
Judgement of Bhavas Timing of Events Through Dasha and Tra PDF
50% (2)
Judgement of Bhavas Timing of Events Through Dasha and Tra PDF
5 pages
Astrology Books
17% (6)
Astrology Books
10 pages
RAIM Algorithm Prediction
No ratings yet
RAIM Algorithm Prediction
4 pages
Jataka Desh Marga PDF
50% (2)
Jataka Desh Marga PDF
4 pages
Astrology Books
17% (6)
Astrology Books
10 pages
Cancer Zodiac Sign
100% (2)
Cancer Zodiac Sign
5 pages
Quick Predicitions 12aug18
No ratings yet
Quick Predicitions 12aug18
15 pages
2015 312408 Jatakalankara PDF
No ratings yet
2015 312408 Jatakalankara PDF
77 pages
Alison Price
No ratings yet
Alison Price
2 pages
Ayurveda Catalog Vol4
No ratings yet
Ayurveda Catalog Vol4
16 pages
Extended Workforce Portal Partner Domain Account
No ratings yet
Extended Workforce Portal Partner Domain Account
3 pages
9 Elements of An Effective Marketing Plan
No ratings yet
9 Elements of An Effective Marketing Plan
11 pages
Google Terms of Service - Privacy & Terms - Google
No ratings yet
Google Terms of Service - Privacy & Terms - Google
17 pages
Comprehensive Digital Marketing Guide
No ratings yet
Comprehensive Digital Marketing Guide
13 pages
Here Are Highlighted Important Quiz and Short Question Module (01.to.25)
No ratings yet
Here Are Highlighted Important Quiz and Short Question Module (01.to.25)
42 pages
Origami Money Flowers Easy 5 Minute Design : Google Chrome - Download Chrome Today
No ratings yet
Origami Money Flowers Easy 5 Minute Design : Google Chrome - Download Chrome Today
15 pages
Excel Training Syllabus: Day Brief Content Detailed Contents
No ratings yet
Excel Training Syllabus: Day Brief Content Detailed Contents
3 pages
BigQuery Optimization Guide
100% (3)
BigQuery Optimization Guide
62 pages
Top 10 Staffing Manager Interview Questions and Answers
No ratings yet
Top 10 Staffing Manager Interview Questions and Answers
17 pages
Summer Training Report On Digital Marketing
0% (1)
Summer Training Report On Digital Marketing
62 pages
IDC Red Book
No ratings yet
IDC Red Book
160 pages
UltraEdit Tutorial
No ratings yet
UltraEdit Tutorial
78 pages
American Band College 2020: 2nd & 3rd Year Conducting - Virtual Edition
No ratings yet
American Band College 2020: 2nd & 3rd Year Conducting - Virtual Edition
5 pages
Casino Marketing Agency
No ratings yet
Casino Marketing Agency
20 pages
Kisi Kisi UAS BAHASA INGGIS II BSI
100% (1)
Kisi Kisi UAS BAHASA INGGIS II BSI
9 pages
Self Doxing Guide. Security Digital Helpline ACCESSNOW
No ratings yet
Self Doxing Guide. Security Digital Helpline ACCESSNOW
6 pages
Module 1 - Niche Marketing Made Easy
No ratings yet
Module 1 - Niche Marketing Made Easy
44 pages
Apple Ios Google Login Guidelines - Google Search
No ratings yet
Apple Ios Google Login Guidelines - Google Search
17 pages
Reference Service To Incarcerated People: Summary Report
No ratings yet
Reference Service To Incarcerated People: Summary Report
9 pages
NeoFax Pediatrics Online
No ratings yet
NeoFax Pediatrics Online
57 pages
Russell Brunsen MM Workbook
100% (9)
Russell Brunsen MM Workbook
34 pages
(GUIDE) (18th Dec) (4noobs) Flashing A FTF File Using Flashtool - Xda-Developers
No ratings yet
(GUIDE) (18th Dec) (4noobs) Flashing A FTF File Using Flashtool - Xda-Developers
7 pages
Google About Bard
No ratings yet
Google About Bard
7 pages
How To Rank On Google
No ratings yet
How To Rank On Google
15 pages
Understanding The Search Results Page: Image From Google Help Center
100% (1)
Understanding The Search Results Page: Image From Google Help Center
14 pages
These 15 Stocks Could See EPS Growth of Over 40
No ratings yet
These 15 Stocks Could See EPS Growth of Over 40
12 pages
Digital Marketing and Its Analysis: S Yogesh, N Sharaha
No ratings yet
Digital Marketing and Its Analysis: S Yogesh, N Sharaha
6 pages
AMIT
No ratings yet
AMIT
3 pages
Postmortem World Building
No ratings yet
Postmortem World Building
32 pages
Alphabet - Google
No ratings yet
Alphabet - Google
111 pages

CIS 555 F P P: P ' F S E: Inal Roject Oogle ENN S Avorite Earch Ngine

Uploaded by

CIS 555 F P P: P ' F S E: Inal Roject Oogle ENN S Avorite Earch Ngine

Uploaded by

CIS 555 FINAL PROJECT

POOGLE: PENN’S FAVORITE SEARCH ENGINE

Professor Andreas Haeberlen

You might also like