Frequently Asked Questions
- How to update my interests?
Users can always re-select the interests. In the Signup/Update Form, select your new interests and type in the email and password that you used when you first registered the service. Then click “Subscribe” button. You will receive the updated digests the next business day.
- How to check service status?
You can check the service status using the following button:
- I forgot my password, how to recover it?
Password can be recovered using the following button:
- How to change my current password?
Password can be changed using the following button:
- I do not want to receive daily digest emails any more but still like to receive conferences digests and use paper/patent search functions, what can I do?
You can disable/re-enable the daily digest service using the button shown below. If daily digest service is disabled, you will not receive daily digest emails any more. You can still read daily digests through our console. The service can be easily re-enabled using the same link.
- How to cancel the service?
You can cancel the service any time using the following button:
- I can see daily paper digests using the console, but never received any daily digest email, what happened?
Probably when you registered the service, you chose to disable the daily digest service. You can re-enable the service using the following button:
It is also possible that emails sent to you are put in spam folder, or blocked by your email service providers. You can mark digests as “not spam”, and whitelist us.
- I do not receive daily digest emails on some weekdays, what happened?
There are three possible reasons:
(1) if the areas you signed up are not very active, then it is normal that you do not receive updates everyday.
(2) The paper websites that we are tracking (like arxiv.org) may not release new papers in time due to holidays or outages. Once their services are back to normal, paperdigest.org will send out digest emails as soon as possible (usually the next day). Such scenarios do not happen very often (1-2 times per year).
(3) Emails sent to you have been blocked by your email service provider. If emails sent to a user are bounced back multiple times, we will have to disable the associated account. If a large number of emails sent to a domain are bounced back, we will have to limit #emails we can send to that domain. We notice that users using emails from qq.com and 163.com experience this problem more frequently than the others.
(4) Universal Solution: Users can login to the console and read digests under Digest/Daily Digest, which provides the same contents as email sent outs.
- I get digests for more than 200 papers everyday, and I cannot read all of them. How can I receive fewer?
This is the #1 complaint we receive from our daily digest subscribers.
Root Causes: Some areas are active like ‘cs.AI’, so it is expected to see a lot of new papers everyday when multiple such active areas are selected. The problem gets worse in January-March, since deadlines of many conferences fall in this time range.
Solutions: We recommend four solutions:
(1) We always encourage users to narrow down the selected interests (see How to change my interests?). With fewer interests selected, users get fewer updates.
(2) Papers in daily emails are sorted by primary_category, readers can decide what papers to read based on categories if he/she does not have time to read all.
(3) We have a console for users to filter out papers that they are not interested in. Users can login the console, and then click on /Digest/Daily Digest. This returns a list of links to all daily digests in the last two weeks. The contents here are the same as what we sent out in daily digest emails. For each day, one can choose to read new papers only, updates to previous papers, or both. Users can also sort papers based on columns and type in a keyword in “filter” field to only show papers with the desired keyword.
(4) Users can suspend daily digest service using this link. When daily digest service is suspended, users will not receive daily digest emails, but still have access to the daily digests through console and be able to receive other updates like conference digest notifications. Users can always re-enable this service using the same link.
- What is the difference between Daily Digest and Topic Tracking?
Daily digest emails are generated based on user selected categories. There are more than 200 such categories in total, like artificial intelligence, computer vision. Topics in topic tracking typically have a much narrower coverage compared to user selected categories. We currently track ~60 topics, covering trending topics in Biology/Health, Computer Science, Finance, Math, and Physics. Depending on the demand, we may frequently add new topics and remove old topics. The categories in daily digests do not change very often.
- In “Best Paper” digests, how are the most influential papers selected?
“Best Paper” digests are created for top conferences/journals. All papers published on a given year are ranked based upon #paper citations + #patent citations. The ranking lists are also updated regularly to reflect the most recent changes.
- I have a paper with a lot of citations, but it does not appear in your “Best Paper” digests, may I know why?
There are several possible reasons.
(1) the papers in best paper digests have more citations.
(2) our system does not have all citations that you are aware of or our system does not associate citations to the right paper. For example, our system may fail to match a paper and its citations, when the paper title or author names contain foreign characters.
(3) feel free to let us know if you find any problem, we will look into it and get the problem fixed. All changes will go into the next updates.
- Precisely, what is a “highlight”?
By our definition, a highlight is a sentence that can immediately tell readers what the paper is about. With such highlights, readers should be able to quickly browse a large number of papers, keep up with the most recent work and find the papers that they like to focus on.
- What is “IF” and how is it calculated?
“IF” stands for impact factor. It is a score in 1.0-10.0, calculated based on paper citations, patent citations, etc. A higher value indicates a broader impact. This is not the score used to rank “most influential papers”, which is ranked based on a much simpler metrics: #paper_citations + #patent_citations.
- What does the number in your daily digest email title represent, for example, 0151832 in “Paper Digest-2020.09.08 0151832”?
It is a random number we purposely append to email title. As many of you know, some email service providers may block an email sender if that sender sends out a large number of emails with the same contents (bulk emails). Paper Digest has a large number of daily digest subscribers, and needs to send out a lot of emails on a daily basis. Even though our contents are customized for each individual and thus different, the email titles used to be the same (e.g. “Paper Digest- 2020.09.08”) for all subscribers. To lower the chance that our emails will be marked as spam or rejected, we append a random number to each email title to make titles look different as well.
- Do you send out digests every day?
Digests are sent out Monday-Friday.
- Who is running paperdigest.org?
We are a group of researchers and engineers working on machine learning and natural language processing.
- What services do you provide?
We currently offer two paper digest services: daily paper digest service (introduced in early 2018 to serve subscribed users) and conference digest service (introduced in mid 2019 to serve public users). We also offer search services (paper, patent, grant, person) and real-time topic tracking services.
- What is the advantage of using your paper search service compared to using other academic search sites?
There are a couple of advantages: (1) Most of our search results come with highlight sentences. This helps users quickly decide if a paper is worth reading or not. We are the first to provide this feature in industry, and able to offer high quality highlights for all subjects. (2) Every paper in our result list is associated with related papers, patents, grants, experts and organizations. This feature is unique in industry. (3) Author profiles of each paper are also provided. Users can click on author names and see their recent papers, grants and patents.
- What technologies are you using to generate the paper digest and search results?
We have a fairly complicated text analysis platform to support paper tracking, search and summarization services. The platform contains (1) a central knowledegebase storing tens of millions of documents providing paper and patent search results; (2) a number of site crawlers tracking new papers and other related documents in real-time; (3) a pipeline with built-in natural language processing and machine learning components to analyze documents. Internally, we have several different algorithms (from rule-based to transformer-based) for document analysis. We have been improving them over time with the most recent technologies.
- Can you explain how paper highlights are generated in detail?
Given a paper, our system first processes the paper to get the parse results. Then a set of candidate generation components start working on the parse results to generate highlight candidates. Different generation components are based on different strategies. For example, any single sentence in the paper can be generated as a candidate; adjacent sentences can be merged to a candidate; sentences connected by co-references can also be formed as candidates, etc. We usually generate a lot of candidates for each paper. After candidates are created, a set of scoring components will be applied to score each candidate from different perspectives. For example, some scorers are rule-based to check if the candidate has the desired patterns; some are deep learning based scoring candidates based on pre-trained neural networks; some model the location of candidates (from abstract, from conclusion, etc.); some check if the candidate aligns with the title well or not. We have dozens of such candidate generation and scoring components. Techniques used in some of these components are exclusive. Together these components assign each candidate with a set of features. In the last step, a final ranking model ranks all candidates using the features created in the previous steps and produces a confidence score for each candidate. We return the top candidate as the paper highlight if the confidence score is above a threshold. Depending on applications and data availability, we can choose what components to integrate in the pipeline.
- Are you using deep learning in your system?
Some of our components are based on neural networks. They are integrated in our pipeline system.
- Where do your highlight sentences come from?
This depends on what the input is. The input can be as long as a full paper or as short as title plus abstract. Highlight sentences can be created from anywhere in the input. Most of the time, we use title + abstracts as our input.
- How to contact paperdigest.org?
All other questions, requests and comments can be sent to email@example.com.