titles of papers; | |
parameter |
|
word branches. | |
1: | |
2: | let |
3: | |
4: | predict |
5: | |
6: | generate directed edges from |
7: | use |
8: | |
9: | |
10: |
word branches; | |
cropped word branches. | |
1: | calculate the token-paper matrix ( |
2: | calculate the tf-idf matrix ( |
3: | |
4: | rank tokens according to { |
5: | |
6: | crop the tokens of { |
7: | crop the edges connecting those tokens; |
8: | |
9: | crop the tokens of { |
10: | crop the edges connecting those tokens. |
11: | |
12: |
titles and abstracts of papers; | |
list of token lists. | |
1: | stem words using the PorterStemmer of NLTK2; |
2: | remove stopwords using the stopword corpus of NLTK; |
3: | remove the words that appear in less than |
1: | model=get model(token num=max(len(source token dict), len(target token dict)), embed dim=32, encoder num=4, decoder num=4, head num=4, hidden dim=32, dropout rate=0.05, use same embed=False,) |
2: | model.compile(‘adam’, ‘sparse categorical crossentropy’) |
3: | model.fit(x=[np.array(encode input∗30), np.array(decode input∗30)], y=np.array(decode output∗30), epochs=5, batch size=32,) |
Time | a | b | c | d | e | f |
---|---|---|---|---|---|---|
1999 | 2,475 | 3,274 | 95,021 | 0.11 | 2.371 | 0.998 |
2000 | 2,380 | 3,347 | 93,910 | 0.101 | 2.395 | 0.998 |
2001 | 2,455 | 3,477 | 108,954 | 0.108 | 2.355 | 0.999 |
2002 | 2,812 | 3,710 | 117,269 | 0.094 | 2.272 | 1.0 |
2003 | 2,656 | 3,592 | 115,019 | 0.1 | 2.312 | 0.999 |
2004 | 2,955 | 3,919 | 138,451 | 0.101 | 2.299 | 0.999 |
2005 | 3,131 | 4,084 | 154,041 | 0.099 | 2.275 | 0.999 |
2006 | 3,248 | 4,260 | 166,614 | 0.1 | 2.289 | 0.999 |
2007 | 3,419 | 4,368 | 184,420 | 0.102 | 2.279 | 0.999 |
2008 | 3,408 | 4,436 | 184,881 | 0.104 | 2.304 | 1.0 |
2009 | 3,658 | 4,609 | 212,771 | 0.098 | 2.218 | 1.0 |
2010 | 3,639 | 4,668 | 221,090 | 0.1 | 2.204 | 0.999 |
2011 | 3,462 | 4,688 | 220,020 | 0.111 | 2.228 | 0.999 |
2012 | 3,621 | 4,875 | 209,517 | 0.114 | 2.28 | 1.0 |
2013 | 3,593 | 4,846 | 231,959 | 0.096 | 2.189 | 1.0 |
2014 | 3,334 | 4,679 | 210,099 | 0.096 | 2.208 | 1.0 |