AlamySearch in detail
This article describes in depth how Alamy determines the order in which to display images in search results, planned future developments and gives some general guidance on how contributors can get the most out of their images on the Alamy site. Please note that all the systems discussed below are undergoing a process of continuous improvement. These notes may be updated from time to time to reflect any changes.
For the purposes of AlamySearch the following definitions apply:
- AlamyRank
- The score assigned to a collection (pseudonym) based on the number of times images from a set of images have been clicked or purchased in proportion to the number of times they have been viewed by our customers.

- AlamySearch
- The system composed of AlamyRank + Search Engine + Diversity Algorithm.
- Click
- The click of a thumbnail by a customer to zoom the image.
- Collection
- Any group of images assigned the same pseudonym.
- Combined score
- The score that is calculated from AlamyRank + Search Engine + Diversity Algorithm.
- Customer
- Any registered individual client who has spent over a certain threshold with Alamy.
- Diversity Algorithm
- The mechanism to ensure that no single pseudonym dominates the results. It disperses the images evenly among suppliers of similar AlamyRank and relevancy.
- Pseudonym
- The name used to group a collection of images. A contributor can have one or more pseudonyms associated with their account. This is to allow for those contributors who represent multiple photographers, or for those who wish to segment their collections to reflect one or more areas of specialisation in their work.
- Sale
- Any image purchase by a customer who has spent over a certain threshold with Alamy.
- Search engine
- The system that identifies which images on Alamy contain the words searched for by a client. These words may be in any of the caption, keyword, location fields, or pseudonym/agency name.
- View
- The thumbnail of an image being presented in search results to a customer (on a page that is viewed by that customer).
- CTR
- Click through rate. The figure that indicates a pseudonym’s rate of clicks (zooms) per view.
Background
Alamy’s policy of not editing images, charging a low commission, and leaving the keywording to its contributors has proved popular with customers and contributors.
Alamy has always faced the challenge of delivering the right images for a search. Since launch, the sort order had a built-in bias that favoured agencies over photographers. This was the result of a search order logic which gave an approximately fair share of views between contributors. The ‘fairness’ of this system was eclipsed over time by a growing number of images submitted by individual photographers who were given an artificially low share of customer views. The result was a lottery where it became a matter of chance which images were presented to a client.
Alamy appreciated the limitations of this system and began investing heavily in a new approach to create an environment where all contributors could compete on an equal footing.
The first version of the new search system AlamySearch, was launched on 10th October 2006. A key component is AlamyRank, a landmark in the industry in terms of fairness of presentation of images to clients. Where other image libraries may order images using solely one parameter, such as reference number or date uploaded, Alamy’s new system relies exclusively on client activity to objectively place the most popular image suppliers higher up the sort order.
What is AlamySearch?
AlamySearch is a new system composed of three elements: AlamyRank, the Search Engine and the Diversity Algorithm. These elements combined determine the positioning of a collection of images within a set of search results and create a "Combined Score" for a collection.
AlamyRank
AlamyRank assigns a score to a pseudonym based on the number of times images from that collection have been clicked or purchased in proportion to the number of times they have been viewed by customers. The result is an AlamyRank per collection (pseudonym).
The simplified formula for this relationship is:

The Search Engine
The search engine finds images that match the search terms entered by the customer. If a customer searches for ‘dog’ then all the images containing the word ‘dog’ will be returned. Currently, these words may be in any of the caption, keyword, location fields, or in the pseudonym/agency name. In order to return more relevant results the search engine returns exact word matches only, i.e it does not "stem" words., so a search for "Dog" will not return the same result as a search for "Dogs".
In 2007, Alamy launched the first version of its new search engine that will allow better control over how the system behaves, particularly in terms of relevancy. The previous search engine on Alamy gave equal weighting to caption, keywords, location, descriptions and pseudonyms, which was not ideal. The improvements made with the new system are as follows:
The search engine no longer searches the description field meaning the keyword fields are now searched with priority, giving more accurate results.
A tiered annotation system. There are now three keyword fields carrying different weights – Essential (50 characters), Main (300 characters) Comprehensive (856). The search engine views them in descending significance with the essential keyword field having the greatest relevance. Within the search results, although all fields are searched, only a selection of the comprehensive keywords are displayed and we do not display the essential and main keyword fields, this is to prevent keyword theft.
A reduction in significance of the pseudonym field relative to the caption and keyword fields. This helps ensure that, for example, when a client does a search for "fish", a contributor called David Fish does not have his images returned before images of fish (which contain the word fish in their caption or keyword fields).
Proximity (not yet implemented) - words in sequence that match phrases entered by customers will carry more weight than words not in sequence For example, if a client searches for "Blue Whale" an image with the words "Blue" and "Whale" next to each other in the caption or keywords will have a greater relevancy score than an image containing the words "Blue, Sea, Humpback Whale" in this order.
The Diversity Algorithm
The Diversity Algorithm comes into play once AlamyRank and the search engine have established which images have the highest score. Its purpose is to ensure that the images returned are inter-dispersed amongst images of similar rank. This means that search results are not dominated by the collections with the highest combined score and that no single pseudonym dominates the results when others are available. The strength of the Diversity Algorithm diminishes if the gap between the collection rankings is significant. This means that clusters of images from a single collection can appear close together in the search results if there are no competing images available of a similar AlamyRank and/or relevancy score. Given the large number of images on Alamy it is unusual to see this effect on the first few pages of results for most searches.How does AlamyRank work?
AlamyRank is driven exclusively by customer activity. There is no manual intervention in the system. Various measures are used to calculate the AlamyRank for a pseudonym.
"What is measured?"
The decision on what types of client activity to measure in this current version of AlamyRank is based on a detailed analysis of the available data alongside Alamy’s understanding of the purchasing patterns and activity of customers on the site.
In this current version of AlamyRank only sales, clicks and views are used to calculate the AlamyRank for a collection. Clicks were chosen as a strong correlation exists between a click (zoom) and a purchase. In the initial nine month measurement period there was, on average, a purchase for every 20 clicks. Therefore a pseudonym’s CTR (click through rate) score is significant in the calculation of its AlamyRank. The correlation between both shopping carts and lightboxes in relation to a sale was, however, not statistically significant so this type of activity is not used in calculating AlamyRank in the current version.
How does AlamyRank determine a rank for a pseudonym?
The current version of AlamyRank produces a rank per pseudonym and not per image. If a contributor's images are viewed frequently but not clicked or purchased very often it will result in that pseudonym having a low CTR. AlamyRank interprets this to mean that the contributor's images are less popular than a contributor whose images are clicked or sold more often in proportion to how many times they have been viewed. A lower AlamyRank would therefore be assigned to this pseudonym. The ultimate aim is to develop a search system that produces a rank per image.
When AlamyRank went live the ranks for all collections had been calculated on nine months of customer activity. We now update the rankings approximately every 100 days using the previous 300 days data.
All collections that have recorded a sufficient number of views to be statistically significant have had their true AlamyRank applied. All collections that have insufficient views or insufficient sales in the 300 day period have the median AlamyRank applied. Applying the median score to new collections should give them enough visibility from which to calculate a true AlamyRank. Some ranks will go up, some will stay the same, and others will go down.
AlamyRank ignores whether a collection of images is from a single photographer, a specialist agency, an RF agency, a national collection or a small cooperative of photographers. It is only concerned with how many times images within a collection have been viewed, clicked, and purchased.
How does AlamyRank affect contributors?
Alamy’s hypothesis is that, whilst the order in which images are displayed is important, the absolute position of an image is not particularly significant in determining which image will be purchased as picture buyers they generally look through many pages of images before purchasing.
AlamyRank rewards contributors who consistently keyword their images accurately and, over time, will penalise those who don’t. It also penalises contributors who submit too many similar images. Whilst the system has no way of identifying “similars”, supplying many similar images results in a contributor’s images being viewed more frequently by customers. A lot of views can only be good if there are enough clicks or purchases to be competitive with other contributors with a similar number of views.
AlamyRank has no way of determining which image in a particular collection is the best. For example, it does not know which is a contributor’s best image of a dog. All images under one pseudonym have the same AlamyRank assigned. When a client performs a search, the search engine determines the relevancy based upon where the search term features in the various annotation fields. This, combined with the AlamyRank for the image's pseudonym gives a combined score which determines where the image appears in the search results. If there is more than one image with the same combined score and the and same pseudonym, it will return these in a random order. Finally, the Diversity Algorithm disperses images of a similar rank in order that no one supplier dominates the results.
What can contributors do to improve their AlamyRank?
Alamy has released Version 1 of AlamyRank and will be releasing new versions in time. However, there are things contributors can do to make the most of the new search system.
Keywording
Alamy advises that contributors check their keywords carefully to ensure that they are both spelt correctly and are relevant to the image concerned. This will work towards gaining a high CTR score. Keywording both literally and conceptually is still advisable, but contributors should be careful that their captions and keywords are entirely relevant to the image. It is useful for contributors to put themselves in the buyer’s position to see if they would be interested in an image for a particular search term. As a simple example, if keywording an image of a cat it would be best not to add a caption saying ‘Cat which is afraid of dogs’. Each time this image is returned on a search for ‘dogs’ it will be viewed by a client and, in the majority of cases, not clicked or purchased resulting in a low CTR. This will negatively affect the AlamyRank for the pseudonym that is associated with this image. Finally, if the plural and singular applies to your image, then contributors should add both. You can use AlamyMeasures to give you valuable insights into keyword, CTR and search data.
Editing
Alamy has always stressed the importance of editing images before submitting them. For a series of images, it is best to submit a portrait, a landscape and an obscure/conceptual angle from each shoot/subject area to reduce image redundancy and improve the visual quality of results when a client performs a search. Supplying too many similar images will be off-putting to clients and may result in a lower AlamyRank as described in How does AlamyRank affect contributors?
Using pseudonyms to group images
Pseudonyms allow for those contributors who represent multiple photographers or for those who wish to segment their collections to reflect one or more areas of specialisation in their work. As an AlamyRank is assigned to a pseudonym, the rank is the result of client activity across a collection of images. An individual photographer may therefore have had more than one AlamyRank assigned if they have more than one pseudonym.
If all images are grouped into one pseudonym this could, in some circumstances, lower the overall rank. If a contributor is an excellent wildlife photographer but their travel imagery is not as strong as the competition, it may be advisable to split their collection into two pseudonyms. The reason for this is that the comparatively low number of sales and clicks in proportion to views for the travel images may bring down the overall AlamyRank. If a photographer were to split his/her collection into different pseudonyms, they can use AlamyMeasures to track the performance level of each one.
Future plans
The initial releases of AlamyRank have already led to a huge improvement on Alamy’s old searching system. The only way to objectively measure the impact of AlamyRank is to build a realistic picture of what is happening in terms of image placement and sales over time. The rankings are updated approximately every 100 days using the previous 300 days data.
To ensure that customers and contributors benefit fully from the new system, feedback is provided to contributors to help them improve the rank of collections via AlamyMeasures. The goal is to allow contributors to make informed decisions about how to submit and describe their pictures.
Concluding remarks
Alamy has achieved its first objective of building a large volume of imagery from a wide variety of sources. There is now a level playing field so that contributors can compete effectively in the marketplace.
The second objective is to create a more efficient and dynamic marketplace for commercial photographs, where contributors can maximise their sales potential on Alamy. To achieve this Alamy will be developing tools to allow images to be annotated more easily in response to client activity.
![Alamy Images [home]](/images/alamy.png)