Design an API that will suppor

Google Interview Question for Software Engineer / Developers

0

of 0 votes

13
Answers
Design an API that will support constant time add, remove, search and random find operations. Random find will get a random number and return that element. Note: Only hash map will not be sufficient since it cannot support random read.
- hsantosh71 June 15, 2013 in United States | Report Duplicate | Flag | PURGE
Google Software Engineer / Developer Data Structures

Email me when people comment.

An error occurred in subscribing you.

Country: United States
Interview Type: In-Person

More Questions from This Interview

Email me when people comment.

An error occurred in subscribing you.

Comment hidden because of low score. Click to expand.

of 12 vote

A hash table + an array that contains all the keys. The order of the keys stored in the array is not important

add
- we insert the key of the element at the end of the array (push_back).
- the element is then added in the hash table.
- we also keep in the hash table the position where the key was stored in the array

remove
- from the hash table we determined the position where the key of the element is stored in the array. We need then to removed it (this will create a hole). We move then the last key from the array to the position of the removed key. Of course, we need to update the position which was stored in the hash table (for this last key).
- then we do the "normal" removal from the hash;

getrandom
- since all keys are stored in the array, it's really easy to determine a random key ( array[rand() % array.size()]. we then return the random element from the hash table

- Andrei June 15, 2013 | Flag Reply

Comment hidden because of low score. Click to expand.

of 2 votes

Andrei, if you use an array, the insertion are not always O(1). I suppose that you are using a dynamical array (because of the push_back). Every time you run out of space, you have to increase its size and copy every element to a new array with a bigger size.

Since I read this problem, I had an idea very similar to yours on how to implement this DS. But instead of using a Hash Table + Array, used 2 Hash Tables. I have posted the implementation below. Check it out.

- arturo June 16, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 2 votes

@arturo You have a good point +1. I think Andrei is using a STL vector, because he mentioned push_back(). Insertion or removal of elements of std::vector are linear in distance to the end of the vector, which is O(n), if you're not inserting/removing from/to the end of the vector, which is what you're doing in your remove function (see en.cppreference.com/w/cpp/container/vector for complexity of operations).

I think a better choice would be a simple array with dynamic resizing and shrinking. You can use a load factor (3/4) to determine when to grow or resize the array and the complexity will be _amortized_ O(1).

- oOZz June 17, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@oOZz Even if you use an array with dynamic resizing and shrinking, and control the load factor, the total work of managing the array is O(n), (unless you know in advance the maximum number of elements you're going to have) because every time you have to resize the array (in some insertions or deletions), you'll have an operation of O(n) complexity of moving elements from one array to another.

I have posted below a solution with expected constant complexity O(1) in every of the mentioned 4 operations, using 2 Hash Tables. Check it out and let me know what you think about it.

- arturo June 17, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 3 votes

@arturo, @Andrei's solution is fine, it just needs a little tweak on the delete operation. The inserts are to the end of the vector, which is O(1). However for deletes, instead of deleting from the ith position (which is O(n)), he needs to copy the last element over to the ith position and delete the last element, because deleting from the end from a vector is O(1).

Please note, that these operations are "amortized O(1)", which means when you consider entire sequence of operations of the program, in the worst case growing and shrinking the array can cannot occur again and again for a long time, thus "amortizing" its cost. (see en.wikipedia.org/wiki/Amortized_analysis)

- oOZz June 17, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@oOZz You're right. I didn't consider the Amortized analysis. In that case, the best solution, as you said, is using a Hash Table and a dynamic resizing array. Thanks for the information.

- arturo June 20, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 0 votes

@arturo, @oOZz isn't deleting the node will require the node to looked up in array which is unsorted hence O(n) so delete isn't O(1) ??

- ashwini verma June 25, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 1 vote

@ashwini, The look up will take O(1) time because we have stored in the hash table the position of each key in the array. So the hash table will have the following key and values :
Key(string)->(Value(int), Index in array(int))

- arturo June 25, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 3 vote

This is an implementation using two Hash Tables that has an approximate complexity of O(1) for every operation (Add, Remove, Search and Random) if the Hash Tables are well implemented (good hash function and load factor).

The Hash Tables are:
map1: Key(string)->(Value(int), Index of map2 (int))
map2: Key(int)-> Value(string)

The Key of map2 represents an index (from 0 to size of data-1) and its value maps with Key of Map1.

Below you can see the C++ implementation (I used STL <unordered_map> for Hash Table), and assumed the Key is a "string" and the value is an "int".

#include <unordered_map>
#include <string>
#include <array>

using namespace std;
class MyStruct
{
public:
	
	MyStruct() {size_of_data_=0;}

	virtual ~MyStruct(){}

	void Add(string str, int v){
		array<int,2> arr;
		arr[0] = v;
		arr[1]= size_of_data_;
		map1_[str] = arr;
		map2_[size_of_data_++]=str;
	}

	int Search(string str)
	{
		unordered_map<string,array<int,2> >::iterator it =
                map1_.find(str);

		if(map1_.find(str)!=map1_.end()) //Element found
			return it->second[0];
		else //Element not found
			return -1;
	}

	void Remove(string str)
	{
		unordered_map<string,array<int,2> >::iterator it = 
                 map1_.find(str);

		//If element to remove doesn't exist in our Data
		if(it==map1_.end()) 
			return;

		//Get the index element in the map2
		int map2_idx = it->second[1];
	
		//Get key (string) of last element in the map2
		string last_str = map2_[size_of_data_-1];

		//Replace with element to remove with the last element
		map2_[map2_idx] = last_str;

		//Update the map2_idx of last key of map2
		map1_[last_str][1] = map2_idx;

		//Erase element from map1
		map1_.erase(str);

		//Erase last element from map2
		map2_.erase(size_of_data_-1);

		//Decrement size_of_data
		size_of_data_--;
	}

	int Random()
	{
		//Generate big rand number - use your favorite randgen
		unsigned int rand_number = GetRandNumber();

		//If you want to return the value of the random element
		return map1_[map2_[rand_number%size_of_data_]][0];

		//If you want to return the key of the random element
		//uncomment next line
		//return map2_[rand_number%size_of_data];
	}

private:
	unordered_map<string, array<int,2> > map1_;
	unordered_map<int,string> map2_;
	int size_of_data_;
};

- arturo June 16, 2013 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

Isn't random meant to be random number access...say give me the 2nd number in the sequence. In which case your example doesn't work. Please correct me if I am wrong.

- IntwPrep August 13, 2013 | Flag

Comment hidden because of low score. Click to expand.

of 0 vote

add remove search what? in a tree, array of Structure, Linked list...?? please make it clear..:-)

- Jitendra June 15, 2013 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 vote

How about LRU cache ?
I think LRU cache using hashtable and Doubly linked list has O(1) insert, delete, search and random search too.

- king June 16, 2013 | Flag Reply

Comment hidden because of low score. Click to expand.

of 0 votes

does the random search become a problem with LRU cache approach? From Double Linked List, how to do random search with O(1)?

- Anonymous August 01, 2013 | Flag

CareerCup

Google Interview Question for Software Engineer / Developers

Books

Videos

Resume Review

Mock Interviews