Author Archives: Anurag

Query Suggestions for Detailed Queries


Last year we added query suggestions to help students explore topics they may not be familiar with. These suggestions go from a broad search to deeper and more specific areas. But many of you are already well familiar with your research area, and your searches are already specific and detailed. Sometimes, it's good to take a step back and go into a different, but related, space.

Today, we're adding query suggestions for detailed queries. They help researchers explore topics related to the original query. For example, consider the query suggestions for [semantic segmentation object detection]. They cover:
Semantic segmentation: [semantic segmentation rgb d images], [fully convolutional networks for semantic segmentation], [deep structured models for semantic segmentation], [indoor semantic segmentation], [fast semantic segmentation], [semantic segmentation scene classification], [semantic segmentation deconvolution network]
Object detection: [localization accuracy object detection], [joint object detection], [real time object detection]
Combination of concepts: [rich features object detection and segmentation], [semantic segmentation context for object detection]

Note that query suggestions appear below search results.
The new query suggestions span all broad areas of research. For example, see [prions protein folding], [global stock market portfolio selection], [test salmonella spp], [racial discrimination and gerrymandering], [gamma irradiation diamond detector], [binary planet formation], [aspect based sentiment analysis], [axial flow turbojet engine].

For now, the additional suggestions are limited to English queries. We plan to expand the coverage to more languages.


Posted by: Namit Shetty, Software Engineer

2017 Scholar Metrics Released


Scholar Metrics provide an easy way for authors to quickly gauge the visibility and influence of recent articles in scholarly publications. Today, we are releasing the 2017 version of Scholar Metrics. This release covers articles published in 2012–2016 and includes citations from all articles that were indexed in Google Scholar as of June 2017.

Scholar Metrics include journal articles from websites that follow our inclusion guidelines, selected conference articles in Computer Science & Electrical Engineering and preprints from arXiv and NBER. Publications with fewer than 100 articles in 2012-2016, or publications that received no citations over these years are not included.

You can browse publications in specific categories such as Ceramic Engineering, High Energy & Nuclear Physics, or Film as well as broad areas like Engineering & Computer Science or Humanities, Literature & Arts . You will see the top 20 publications ordered by their five-year h-index and h-median metrics. You also can browse the top 100 publications in several languages - for example, Portuguese and Spanish. For each publication, you can view the top papers by clicking on the h5-index.

Scholar Metrics include a large number of publications beyond those listed on the per-category and per-language pages. You can find these by typing words from the title in the search box, e.g., [allergy], [cardiología], [biomarkers].

For more details, see the Scholar Metrics help page.

Posted by: Anurag Acharya, Distinguished Engineer

Classic Papers: Articles That Have Stood The Test of Time

Scholarly research is often about the latest findings - the newest knowledge that our colleagues have gleaned from nature. Some articles buck this pattern and have impact long after their publication.

Today, we are releasing Classic Papers, a collection of highly-cited papers in their area of research that have stood the test of time. For each area, we list the ten most-cited articles that were published ten years earlier.

This release of classic papers consists of articles that were published in 2006 and is based on our index as it was in May 2017. To browse classic papers, select one of the broad areas and then select the specific research field of your interest. For example, Agronomy & Crop Science, Oil, Petroleum & Natural Gas, and African Studies & History.

The list of classic papers includes articles that presented new research. It specifically excludes review articles, introductory articles, editorials, guidelines, commentaries, etc. It also excludes articles with fewer than 20 citations and, for now, is limited to articles written in English.

Posted by: Sean Henderson, Software Engineer

Organizing your Scholar library

Google Scholar Library allows you to build your personal collection of articles within Scholar. You can save articles right from the search page, organize them with labels, and use the power of Scholar's full-text search & ranking to quickly find just the one you want. You decide what goes into your library and we provide all the goodies that come with Scholar search results - up to date article links, citing articles, related articles, formatted citations, links to your university’s subscriptions, and more.

As personal libraries have grown over time,  managing them takes more effort. Today we are making organizing your library easier by making it possible to update or export multiple articles with a single click. For example, if you are writing a new paper, you can quickly export the articles to cite to your favorite reference manager; if you are grouping papers that explore different aspects of your research area, you can select all papers in a sub-field and label them with one click.



If you don’t yet have a library,  it is easy to create one.

Posted by: Deepak Jindal, Senior Staff Engineer

Organizing your Scholar library

Google Scholar Library allows you to build your personal collection of articles within Scholar. You can save articles right from the search page, organize them with labels, and use the power of Scholar's full-text search & ranking to quickly find just the one you want. You decide what goes into your library and we provide all the goodies that come with Scholar search results - up to date article links, citing articles, related articles, formatted citations, links to your university’s subscriptions, and more.

As personal libraries have grown over time,  managing them takes more effort. Today we are making organizing your library easier by making it possible to update or export multiple articles with a single click. For example, if you are writing a new paper, you can quickly export the articles to cite to your favorite reference manager; if you are grouping papers that explore different aspects of your research area, you can select all papers in a sub-field and label them with one click.



If you don’t yet have a library,  it is easy to create one.

Posted by: Deepak Jindal, Senior Staff Engineer

2016 Scholar Metrics Released


Scholar Metrics provide an easy way for authors to quickly gauge the visibility and influence of recent articles in scholarly publications. Today, we are releasing the 2016 version of Scholar Metrics. This release covers articles published in 2011–2015 and includes citations from all articles that were indexed in Google Scholar as of June 2016.

Scholar Metrics include journal articles from websites that follow our inclusion guidelines, selected conference articles in Computer Science & Electrical Engineering and preprints from arXiv and NBER. Publications with fewer than 100 articles in 2011-2015, or publications that received no citations over these years are not included.

You can browse publications in specific categories such as Food Science & Technology, Sustainable Energy, or Public Health as well as broad areas like Engineering & Computer Science or Humanities, Literature & Arts . You will see the top 20 publications ordered by their five-year h-index and h-median metrics. You also can browse the top 100 publications in several languages - for example, Portuguese and Spanish. For each publication, you can view the top papers by clicking on the h5-index.

Scholar Metrics include a large number of publications beyond those listed on the per-category and per-language pages. You can find these by typing words from the title in the search box, e.g., [journalism], [saúde], [genes].

In this release, we have added per-language pages for five new languages - Russian, Korean, Polish, Ukrainian, and Indonesian.

For more details, see the Scholar Metrics help page.

Posted by: Anurag Acharya, Distinguished Engineer

Query suggestions to help explore new topics


As a graduate student, I often had to find and read papers for my courses - usually in areas that I wasn't familiar with. Google Scholar had already made it possible to find papers in all areas of research and the key challenge was to find the right keywords to search for. And then, when I joined the Scholar team, I had to quickly come up to speed with yet more research fields.

Today, we are launching query suggestions to help users explore topics they may not be familiar with. When you do a query, the results page may also include related search queries to help you  explore different directions within your topic of interest. Query suggestions appear after search results.

For example, see  [antiparkinson]. As Wikipedia mentions, antiparkinson medications are used to treat/relieve the symptoms of Parkinson's disease. The suggested queries span several directions: Query suggestions span all broad areas of research. For example, see [gps antenna], [prions], [vaccination], [drug-eluting stents], [estoppel], [conformal field theory], [distributed database], [optimal stopping problem].

I wish I had access to something like this when I started working on query suggestions. Being able to quickly explore topics like [collocations], [language model] and [syntactic parsing] would have helped quite a bit...

As yet, query suggestions are available for selected English queries. We plan to expand the coverage to more languages and queries.

Posted by: Namit Shetty, Software Engineer

Quickly lookup references


As a graduate student and then a faculty member, I spent many a day trying to find references I had seen in articles. Tracking down each reference and then a copy of it that I was able to read often took several steps. With many a slip twixt the cup and the lip.

To help researchers quickly lookup references, Scholar now automatically identifies queries that are likely to be looking for a specific paper. For such queries, it tries hard to find the intended paper and a version that that particular user is able to read. You can lookup full references, e.g.:

King CY, Diaz-Avalos R (2004) Protein-only transmission of three yeast prion strains. Nature 428: 319–323.

Wong PC, Pardo CA, Borchelt DR, Lee MK, Copeland NG, Jenkins NA, Sisodia SS, Cleveland DW, Price DL (1995) An adverse property of a familial ALS-linked SOD1 mutation causes motor neuron disease characterized by vacuolar degeneration of mitochondria. Neuron 14:1105–1116.

Accetta, F. S., Zoller, D. J., & Turner, M. S. 1985, Phys. Rev., D31, 3046

Watanabe, Y., & Komatsu, E. 2006, Phys. Rev. D, 73, 123515

You can lookup article titles:

Emerald: A general‐purpose programming language

Compelling transgenetic evidence for transmission of bovine spongiform encephalopathy prions to humans

If all you remember is some of the authors and words from the title, that works in many cases too:

einstein rosen podolsky 1935

riedel gibson active disks

You can cut-and-paste references, type what you remember of the paper, or better still use the Scholar Button (available for Chrome, Firefox and Safari).

The astute reader has no doubt already figured out that this feature can be embedded on other web sites and can be used by libraries, publishers, teachers and others to help their own readers and students track down scholarly articles. To construct a Scholar lookup URL for an article title or a full reference, URL-escape the text and append it to https://scholar.google.com/scholar?q=. E.g., here is a link to one of our recent articles:

https://scholar.google.com/scholar?q=On+the+shoulders+of+giants%3A+The+growing+impact+of+older+articles.

We would like to thank Cliff Chiung Yu Lin for his contributions in making this feature possible.

Posted by: Anurag Acharya, Distinguished Engineer

Blast from the past: reprint request postcards

Recently, I spent a few days organizing my uncle's papers. He was a graduate student in the 60s and a faculty member for the rest of his life. Going over his papers was like walking through the history of scholarly communication. One of the fascinating things I found were pre-printed postcards for requesting article reprints.

Each institution printed these postcards for its researchers. They included the institution address and a template request. To request a reprint, you would fill in the address of the author and some information about the paper you were interested in and drop it in mail. And hope for a response in six to ten weeks. Here are a couple of requests that my uncle received.





Much has changed since those days. Journal archives have moved online and email zips across the world in seconds. It is hard to imagine today how researchers of the day moved the mountains that they did.

Posted by: Anurag Acharya, Software Engineer

Using Google Scholar in Scholarly Workflows

The next article in our 10th Anniversary Series is by Prof. Jonathan Eisen. He is at the University of California, Davis with appointments in the Department of Medical Microbiology and Immunology in the School of Medicine and the Department of Evolution and Ecology in the College of Biological Sciences. His research focuses on communities of microbes and how they provide new functions – to each other or to a host. He is coordinating the largest microbial sequencing project to date – a Genomic Encyclopedia – being done at the DOE Joint Genome Institute where he holds an Adjunct Appointment. He is also an active and award-winning blogger and microblogger.-- Anurag Acharya


Using Google Scholar in Scholarly Workflows

Jonathan Eisen
School of Medicine & College of Biological Sciences, UC Davis

When Anurag Acharya asked me recently if I would be interested in writing a guest post for the Google Scholar blog in relation to the 10th anniversary of Google Scholar, I immediately responded "Yes." Literally, that was the full content of my email response to his request. Why did I answer so enthusiastically? Well, I can put this down to three main reasons:
So - in thinking about what to write for this post, I came up with three main topics I thought would be good to cover - how I got interested in topics of searching for and sharing scholarly papers, how I use Google Scholar, and some ideas about future possible uses of Google Scholar.

Part 1: Some Background


One day, in ancient history, my wife came home from work (at a biotech startup up focusing on bioinformatics) raving about this new search engine "Google" that people at her company were talking about. As someone who thought of himself as on the cutting edge of web technology, I was a bit dismayed that I had not somehow discovered this myself. But I got over that and tried it out. And, after searching for my name (and being impressed with how well this new search engine worked on such an important topic) I immediately started playing around with searching for scientific papers and data. I did this, I guess, because ever since I was in college, I had been becoming more and more interested in (or some would say obsessed with) issues relating to finding and sharing scientific knowledge.

Without going into too much detail, some of the factors that contributed to my obsession included:
  • Working as a shelver and then assistant in the Museum of Comparative Zoology library in college and seeing how people struggled to find papers of relevance to their work;
  • Spending many years in graduate school (in the 1990s) working on projects that had been largely unstudied since the 1960s, including one (so called adaptive mutation) where researchers claimed to have discovered something new in the 1990s but had in fact missed a rich literature on the topic from the 1950s and 1960s (e.g., see this from 1955).
  • Building and sharing databases where I was trying to include a description of every paper that had been published about specific genes. I note - thanks to the Wayback machine my Stanford website from when I was a PhD student is still available - although alas the specific linked databases are not. I have reposted some of them for people to see what they were like (though many of the links in them are busted). See for example my sites on RecA, SNF2, MutS and more
  • Working on projects to catalog everything known about specific organisms in association with work I was doing to characterize the genomes of these organisms
In these and other projects, I had seen and experienced just how much time could be spent on searching for papers and data about a particular topics. I am not sure I had a well-defined strategy in every case but I came to rely upon some preferred methods including:
  • "Citation walking" where one takes a paper of interest and then asks "how has this paper been cited?” and traverses across the literature via citations
  • Searching for keywords in abstracts and titles
  • Browsing through specific journals
  • Looking for papers by specific authors
  • For data, I mostly would look in specific centralized data repositories such as Genbank for DNA sequence information and PDB for three-dimensional structural data on proteins.
And of course many other approaches. Nothing really novel or brilliant here though I do think I got pretty good at how to carry out such searches. But one of the challenges was each approach had to be done in a different system and some of the systems were only available for a fee and some were not even online. And even with lots of time and pain, many things could be missed.

Thus when my wife introduced me to this new fangled Google thing my thoughts rapidly turned to - how can I use this new tool to help in finding and then sharing scientific papers or data about these genes and organisms I was studying? Did Google searches solve all my "issues" in this regard? Alas, no. But jump forward ~15 years to today and I am quite amazed in retrospect how much of my scholarly workflow flows through Google Scholar. But rather than try to recall and write about how my workflow changed with the advent of Google Scholar I thought I would just jump to the present time and discuss some ways that I use Google Scholar now.

Part 2: Using Google Scholar today


When working on this post I started to look around at how I use Google Scholar and I confess I was amazed at how many different ways I use it in my work. Here are some examples:

Tracking and using citations. One major general use of Google Scholar lies in tracking of citations to specific scholarly works. Here are some ways that I use such information:
  • Citations to individual works. A key aspect of scholarly work in many fields is examining how specific works are cited. Such information has many uses include discovering new works on a topic by seeing how specific papers from the past are cited, assessing impact of works, ego satisfying, and more. For many years, information on how a specific work was cited was nearly impossible to come by without paying for access to citation tracking databases. Now, with Google Scholar I (and others) can very rapidly gather such information.
  • Citation from diverse sources. One aspect of using Google Scholar to track citations to individual works is the way GS finds citations in diverse sources – not just in the peer reviewed scholarly literature. Now, in some ways this can be viewed as a limitation (some may not want to count or even know about citations from self published white papers, for example). But in others ways this is a wonderful thing as one can find citations to one’s work from very diverse sources outside of the “normal” mold.
  • Citation metrics. It is not a large conceptual leap to go from the ability to track citations to individual works to the ability to create summary statistics about citations across many works. There are many indices for such purposes – some useful and some not. But whatever you think of such indices – Google Scholar has opened up the ability for people to calculate such metrics for oneself or to offer services to calculate metrics for others. Such indices can be used in many ways but perhaps the most common is to summarize the citations for one individual researcher. Which leads into my next topic …
Google Scholar Pages. Perhaps my favorite development from Google Scholar in the last 10 years has been the introduction of Google Scholar Pages for individuals. I make use of my Google Scholar page and pages of others for dozens of things including these:
  • Citation metrics for myself. See above for a discussion of citation metrics in general. I use Google Scholar pages to examine citation metrics for myself and my papers all the time (right now GS shows two summary statistics H-index and I-10 index). And I use this information in many ways including putting it on my CV, including it in grant reports, and examining which of my scholarly works have had more “impact”.
  • As landing page for my publication list. Once one has a GS page, GS automatically adds new publications to one’s list and also updates citation counts and other information regularly. Thus I now include a link to my GS page on my blogs, my work web sites, and in my email signature.
  • To keep track of my coauthors. I have been blessed (and perhaps a bit cursed) to work in a field (genomics) where many projects involve large-scale collaborations across many institutions, involving many researchers. And I have found that a nice way to track these coauthors is via GS (although – note to GS folks – there used to be a way to show, publically, all coauthors in a list but I cannot seem to figure out how to do this anymore).
  • Author disambiguation. For people like myself with a relatively unique name, when others search for my scholarly works, they are pretty easy to find (although I note the fact that there is another Jonathan Eisen out there who publishes some works with a bit of a conspiracy theory angle has been both good and bad for me at times). But for many others, their name is not a perfect way to find their work. This may be because they have a name that is relatively common, or it may be because they have changed their name (e.g., after marriage). For such people creating a GS page can be very useful because once one trains GS with a set of works, it can find new works by that same person quite well (I first found out about this author disambiguation by GS when Anurag gave a talk at a meeting I organized last year). GS is certainly not the only tool in author disambiguation and others – like author UIDs (e.g., ORCID) are almost certainly better long term options. I note – author disambiguation may seem like a esoteric topic to many but it has major implications on important issues such as gender equity in academia, since women are much more likely to change their names during their career than men are.
  • Automated updates of new papers by specific authors. One option associated with GS author pages I use extensively is the ability to “follow” specific authors and get notified of new publications of theirs.
  • To keep track of a collection of people. Most researchers do not regularly update their individual publication pages on their websites. However, if those researchers have GS pages one can keep track of their new papers quite easily (either by the follow option mentioned above or just by browsing occasionally). For example, for my microBEnet project I curate a list of GS pages for researchers in the whole field with connections to studies of “microbiology of the built environment” and thus (hopefully) help others keep up with what is going on in the field.
  • Who is in a specific field? One feature of GS author pages that is not used a lot as far as I can tell, but which has some value is the “areas of interest” tag one can add to one’s profile. Though not everyone uses such tags, I have found they are a useful tool in finding researchers working on specific topics. For example, I list “symbiosis” as one of my areas of interest and if I click on the link for that on my page I get a list (sorted by citation counts – which is both useful and annoying) of others who have listed that same area of interest. And many of the people in this list I am not familiar with yet they do work on topics in which I am very interested.
Automated discovery of new papers by topic. Pretty much all scholars these days are drowning in information and in keeping up with scholarly works. There are many reasons for this of course, and there are also some solutions. I find, for example, that social media is a great way to keep up to date on what new papers are coming out or have come out recently. But social media does not find everything and as someone who is responsible for keeping others up to date on various fields (e.g., this is one of my jobs at microBEnet) I also rely on both manual and automated searchers of the scholarly literature to find new papers or old papers I have missed. GS has two key ways to help in this regard. The first is relatively simple in concept but takes advantage of the power of Google indexing – which is just directly searching GS for papers on particular topics. And the advanced search options allow some customization of such searches. But as someone who is quite busy, I do not actually end up searching GS for new papers all that often. Instead I rely upon automated searches through various services including Pubmed, Pubchase, and GS. I use GS in two ways for such automated searches:
  • Create an alert. When one does a search on GS, in addition to results one is presented with an option to “Create an alert”. I now have dozens of such alerts in operation. To avoid getting drowned by the results I set them up to send only once a week and I filter them into a separate mail folder that I only look at when I have time. But I frequently find interesting new papers this way.
  • GS Updates. Another option now available, if one has a GS profile, is to use the GS Updates system (which I have written about before here and here for example). This system uses one’s publication list to scan for new papers that are related in some way to one’s prior work.
Many other uses of GS. I have gone on perhaps way too long here so I am only going to briefly mention a few other uses of GS.
  • Finding online versions of papers. Unquestionably one of the most valuable uses of GS is to find online versions of scholarly works. But since others have written extensively about this I will just say the following: if you publish any scholarly work I recommend you make it freely and openly available AND that you make sure that it gets indexed by GS.
  • Full text searches of the literature. Another critically important aspect of GS is that it facilitates full text searching of the scholarly literature which is important for many reasons.
  • Finding works outside of the “normal” places to publish. Another key feature of GS is that it indexes much more than just publisher’s sites. If one posts a preprint on one’s own web server, that paper may show up in GS (which I think is a good thing). GS also indexes many diverse sources of scholarly works and thus helps in finding works that may otherwise not see the light of day.

Part 3: Where do we go from here?


As an active user of Google Scholar I of course have many comments, complaints, ideas and thoughts about what it could do better and where it might go in the future. And there are SO many things that could be added or improved upon – things like better figure and table searching, better exporting of information, better abilities to curate and create collections and to then use such collections as training sets for automated searchers, and more and more and more. I have written about some such issues and suggestions from time to time in my blog (see for example, this and this and this). There is certainly lots of work to be done.

But in thinking about this I realized that making a list of issues and suggestions is only of limited value. What I think GS really needs is a better public forum where GS can discuss what their plans are for the future and also where users and developers can discuss what they would find useful. And though I see some places for such discussions on the Google Scholar blog and in related sites, I don’t see a lot. So – I would like to end with a call for GS to create a better site for such discussions of the future of GS …