¿­·¢Óé·¢K8com

¿­·¢Óé·¢K8com¿Æ¼¼

¿­·¢Óé·¢K8com

EMNLP2021 | ¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÉÙÁ¿Êý¾Ý¹Øϵ³éÈ¡ÂÛÎı»Â¼ÓÃ

2021-11-09

½üÈÕ£¬EMNLP 2021ÔÚ¹ÙÍøÌáÇ°¹«²¼Á˽ñÄêµÄÂÛÎÄÉó¸å½á¹û£¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÂÛÎÄ¡¶MapRE: An Effective Semantic Mapping Approach for Low-resource Relation Extraction¡·±»Â¼Ó᣸ÃÂÛÎÄÌá³öÁËÔÚµÍ×ÊÔ´¹ØϵÌáÈ¡ÈÎÎñÖÐÈÚºÏͬÀà±ðÑù±¾¼ä¾ä×ÓÏà¹ØÐÔÐÅÏ¢ºÍ¹Øϵ±êÇ©ÓïÒåÁ½¸ö·½ÃæµÄÐÅÏ¢µÄ·½·¨£¬²¢ÔÚ¶à¸ö¹ØϵÌáÈ¡ÀàÈÎÎñµÄ¹«¿ªÊý¾Ý¼¯µÄʵÑéÖеõ½ÁËSOTA½á¹û¡£

undefined
EMNLP£¨È«³ÆConference on Empirical Methods in Natural Language Processing£©Êǹú¼Ê×ÔÈ»ÓïÑÔ´¦Àí¶¥¼¶»áÒ飬ÓÉACL SIGDATÖ÷°ì£¬Ã¿Äê¾Ù°ìÒ»´Î£¬ÔÚGoogle Scholar¼ÆËãÓïÑÔѧ¿¯ÎïÖ¸±êÖÐÅÅÃûµÚ¶þ£¬Ö÷Òª¹Øעͳ¼Æ»úÆ÷ѧϰ·½·¨ÔÚ×ÔÈ»ÓïÑÔ´¦ÀíÁìÓòµÄÓ¦Ó᣽ü¼¸ÄêËæ×Å´ó¹æÄ£Êý¾ÝµÄ»úÆ÷ѧϰ·½·¨µÄ·¢Õ¹£¬¸Ã»áÒéÈËÊýÖðÄêÔö¼Ó£¬Êܵ½Ô½À´Ô½¹ã·ºµØ¹Ø×¢¡£


EMNLPÂÛÎÄÈëÑ¡±ê×¼¼«ÎªÑϸñ£¬EMNLP 2021¹²ÊÕµ½ÓÐЧͶ¸å3114ƪ£¬Â¼ÓÃ754ƪ£¬Â¼ÓÃÂʽöΪ24.82%¡£°´ÕÕ¹ßÀý£¬EMNLP 2021ÆÀÑ¡ÁË×î¼Ñ³¤ÂÛÎÄ¡¢×î¼Ñ¶ÌÂÛÎÄ¡¢½Ü³öÂÛÎĺÍ×î¼ÑDemoÂÛÎÄËÄ´ó½±Ï¹²7ƪÂÛÎÄÈëÑ¡¡£


½ñÄêEMNLP 2021 ½«ÓÚ11ÔÂ7ÈÕ - 11ÈÕÔÚ¶àÃ×Äá¼Ó¹²ºÍ¹úÅîËþ¿¨ÄɺÍÏßÉÏÁªºÏ¾Ù°ì£¬»áÒéΪÆÚÎåÌ죬¸´µ©´óѧ¼ÆËã»ú¿ÆѧѧԺ½ÌÊÚ»ÆÝæݼ½«µ£Èα¾´Î»áÒéµÄ³ÌÐòÖ÷ϯ¡£ÔÚ¼´½«ÕÙ¿ªµÄEMNLPѧÊõ»áÒéÉϽ«Õ¹Ê¾×ÔÈ»ÓïÑÔ´¦ÀíÁìÓòµÄÇ°ÑØÑо¿³É¹û£¬ÕâЩ³É¹ûÒ²½«´ú±í×ÅÏà¹ØÁìÓòºÍ¼¼Êõϸ·ÖÖеÄÑо¿Ë®Æ½ÒÔ¼°Î´À´·¢Õ¹·½Ïò¡£

¿­·¢Óé·¢K8comDeepBlueAIÍŶӵÄÂÛÎÄÌá³öÁËÔÚµÍ×ÊÔ´¹ØϵÌáÈ¡ÈÎÎñÖÐÈÚºÏͬÀà±ðÑù±¾¼ä¾ä×ÓÏà¹ØÐÔÐÅÏ¢ºÍ¹Øϵ±êÇ©ÓïÒåÁ½¸ö·½ÃæÐÅÏ¢µÄ·½·¨£¬²¢ÔÚ¶à¸ö¹ØϵÌáÈ¡ÀàÈÎÎñµÄ¹«¿ªÊý¾Ý¼¯µÄʵÑéÖеõ½ÁËSOTA½á¹û¡£

¹ØϵÌáÈ¡Ö¼ÔÚ·¢ÏÖ¸ø¶¨¾ä×ÓÖÐÁ½¸öʵÌåÖ®¼äµÄÕýÈ·¹Øϵ£¬ÊÇNLPÖеÄÒ»Ïî»ù±¾ÈÎÎñ¡£¸ÃÎÊÌâͨ³£±»ÊÓΪÓмලµÄ·ÖÀàÎÊÌ⣬ÓÉ´ó¹æÄ£±ê¼ÇÊý¾Ý½øÐÐѵÁ·¡£½üÄêÀ´£¬¹ØϵÌáÈ¡Ä£Ð͵õ½ÁËÃ÷ÏԵķ¢Õ¹¡£È»¶ø£¬ÑµÁ·Ñù±¾¹ýÉÙʱ£¬Ä£ÐÍÐÔÄܻἱ¾çϽµ¡£

ÔÚ×î½ü¹¤×÷ÖУ¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÀûÓÃСÑù±¾Ñ§Ï°µÄ½ø²½À´½â¾öµÍ×ÊÔ´ÎÊÌâ¡£ÉÙÑù±¾Ñ§Ï°µÄ¹Ø¼ü˼ÏëÊÇѧϰһ¸öÓÃÀ´±È½ÏqueryºÍsupport set samplesÖÐÑù±¾ÏàËƶȵÄÄ£ÐÍ£¬ÕâÑù£¬¹Øϵ³éÈ¡µÄÄ¿±ê´Óѧϰһ¸öͨÓõÄ¡¢×¼È·µÄ¹Øϵ·ÖÀàÆ÷±äΪѧϰһ¸ö½«¾ßÓÐÏàͬ¹ØϵµÄʵÀýÓ³Éäµ½Ïà½üÇøÓòµÄÓ³ÉäÄ£ÐÍ¡£

ÔÚÉÙÑù±¾Ñ§Ï°µÄÉ趨Ï£¬±êÇ©ÐÅÏ¢£¬¼´°üº¬¹Øϵ±¾ÉíÓïÒå֪ʶµÄ¹Øϵ±êÇ©£¬ÔÚѵÁ·ºÍÔ¤²âʱ²¢Ã»Óб»Ä£ÐÍÓõ½¡£¿­·¢Óé·¢K8comDeepBlueAIÍŶӵÄʵÑé½á¹û±íÃ÷£¬ÔÚԤѵÁ·ºÍ΢µ÷ÖнáºÏÉÏÊö±êÇ©ÐÅÏ¢ºÍ¸÷¹ØϵÀà±ðµÄÑù±¾Á½ÀàÓ³Éä¿ÉÒÔÏÔ×ÅÌá¸ßÄ£ÐÍÔÚÉÙÑù±¾¹ØϵÌáÈ¡ÈÎÎñÉϵıíÏÖ¡£



01

ÓïÒåÓ³ÉäԤѵÁ·
undefined
ԤѵÁ·²¿·ÖµÄÄ¿±êº¯ÊýÓÉÈý¸ö²¿·Ö×é³É£º



CCR: Ñù±¾±íʾ¼äËðʧ
undefined
CRR£ºÑù±¾Óë±êÇ©¼äËðʧ

undefined

MLM£ºÓïÑÔÄ£ÐÍËðʧ£¬Í¬BERT
undefined
¿­·¢Óé·¢K8comDeepBlueAIÍŶӲÉÈ¡ÀàËÆCP (Peng et al., 2020)µÄ·½·¨ÖжÔÄ£ÐͽøÐÐԤѵÁ·¡£²»Í¬Ö®´¦ÔÚÓÚÍŶӻ¹¿¼ÂÇÁ˱êÇ©ÐÅÏ¢£¬Ê¹ÓÃWikidata×÷ΪԤѵÁ·ÓïÁϿ⣬ȥ³ýÁËWikidataºÍDeepBlueAIÍŶÓÓÃÓÚºóÐøʵÑéµÄÊý¾Ý¼¯Ö®¼äµÄÖظ´²¿·Ö¡£


±¾²¿·ÖÖУ¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓʹÓÃBERT base×÷Ϊ»ù´¡Ä£ÐÍ£¬²ÉÓÃAdamWÓÅ»¯Æ÷£¬×î´óÊäÈ볤¶ÈÉèÖÃΪ60¡£¿­·¢Óé·¢K8comDeepBlueAIÍŶӹ²ÑµÁ·ÁË11,000²½£¬ÆäÖÐÇ°500²½Îªwarmup£¬batch sizeÉèΪ2040£¬Ñ§Ï°±ÈÂÊΪ3e-5¡£



02

¼à¶½ÐÔ¹Øϵ³éÈ¡

±¾²¿·Ö¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÒ»¹²ÊÔÑéÁËMapREԤѵÁ·Ä£Ð͵ÄÁ½ÖÖʹÓ÷½Ê½£¬¼´MapRE-L£¨Ö±½ÓʹÓÃÈ«Á¬½Ó²ã¶ÔÎı¾±àÂëÊä³öÔ¤²â¹Øϵ£©ºÍMapRE-R£¨²ÉÓùØϵ±àÂëÆ÷±àÂë¹Øϵ±êÇ©£¬ÔÙ×öÏàËƶÈÆ¥Å䣩£¬Ä£ÐͽṹÈçͼ£º
undefined

ÔڼලÐÔ¹Øϵ³éÈ¡ÈÎÎñÖп­·¢Óé·¢K8com¿Æ¼¼ÆÀ¹ÀÁ½¸ö»ù×¼Êý¾Ý¼¯£ºChemProtºÍWiki80¡£Ç°Õß°üÀ¨56,000¸öʵÀýºÍ80ÖÖ¹Øϵ£¬ºóÕß°üÀ¨10,065¸öʵÀýºÍ13ÖÖ¹Øϵ¡£

ʵÑé½á¹ûÈçÏ£º

undefined
ÕâÀï¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÖصã¹Ø×¢µÍ×ÊÔ´¹Øϵ³éÈ¡£¬Ñ¡È¡ÒÔÏÂÈý¸öÓдú±íÐÔµÄÄ£ÐͽøÐбȽÏ¡£


1)BERT£º¸ÃÄ£ÐÍÔÚÎı¾µÄͷʵÌåºÍβʵÌ岿·Ö·Ö±ðÔö¼ÓÌØÊâµÄ±ê¼Çtoken£¬ÔÚBERTÊä³öºó½Ó¼¸¸öÈ«Á¬½Ó²ãÓÃÓÚ¹Øϵ·ÖÀà¡£


2)MTB (Soares et al., 2019)£ºMTBÄ£ÐͼÙÉèÎ޼ලÊý¾ÝÖÐͷʵÌåºÍβʵÌåÏàͬµÄ¾ä×Ó¾ùΪÕýÑù±¾¶Ô£¬¼´¾ßÓÐÏàͬµÄ¹Øϵ¡£ÔÚ²âÊԽ׶Σ¬¶ÔqueryºÍsupport setµÄÏàËƶȵ÷ֽøÐÐÅÅÃû£¬½«µÃ·Ö×î¸ßµÄ¹Øϵ×÷ΪԤ²â½á¹û¡£


3)CP (Peng et al., 2020)£ºÍ¬MTBÀàËÆ£¬ÎÒÃǵķ½·¨Í¬CPÄ£Ð͵IJ»Í¬µãÔÚÓÚ£¬ÎÒÃÇÔÚԤѵÁ·ºÍ΢µ÷ʱ¾ù¿¼ÂÇÁ˱êÇ©ÐÅÏ¢¡£



ÎÒÃÇ¿ÉÒԹ۲쵽£º

1£©ÔÚBERTÉϽøÐÐԤѵÁ·£¨¼´MTB, CPºÍMapRE£©¿ÉÒÔÌá¸ßÄ£ÐÍÐÔÄÜ

2£©±È½ÏMapRE-LÓëCPºÍMTB£¬ÔÚԤѵÁ·ÆÚ¼äÌí¼Ó±êÇ©ÐÅÏ¢¿ÉÒÔÏÔ×ÅÌá¸ßÄ£ÐÍÐÔÄÜ£¬ÓÈÆäÊÇÔÚ×ÊÔ´¼«ÉÙµÄÇé¿öÏ£¬ÀýÈç½ö1%µÄѵÁ·¼¯ÓÃÓÚ΢µ÷

3) ±È½Ï MapRE-R ºÍ MapRE-L£¬ÆäÖÐÇ°ÕßÔÚ΢µ÷ÖÐÒ²¿¼ÂÇÁ˱êÇ©ÐÅÏ¢£¬±íÏÖ³ö¸üºÃ¸üÎȶ¨µÄʵÑé½á¹û


½á¹û±íÃ÷ÔÚԤѵÁ·ºÍ΢µ÷ÖÐʹÓñêÇ©ÐÅÏ¢¾ù¿ÉÏÔÖøÌá¸ßµÍ×ÊÔ´¼à¶½ÐÔ¹Øϵ³éÈ¡ÈÎÎñÉϵÄÄ£ÐÍÐÔÄÜ¡£



03

ÉÙÑù±¾ÓëÁãÑù±¾¹Øϵ³éÈ¡



ÔÚÉÙÑù±¾Ñ§Ï°µÄÇé¿öÏ£¬Ä£ÐÍÐèÒªÔÚÖ»Óиø¶¨Ò»¶¨¹ØϵÀà±ð£¬Ã¿¸öÀà±ðÉÙÊýÑù±¾µÄÇé¿öϽøÐÐÔ¤²â¡£¶ÔÓÚN way K shotÎÊÌ⣬Support set S°üº¬N¸ö¹Øϵ£¬Ã¿¸ö¹ØϵÓÐK¸öÑù±¾£¬²éѯ¼¯°üº¬Q¸öÑù±¾£¬Ã¿¸öÑù±¾ÊôÓÚ N ¸ö¹Øϵ֮һ¡£


¸ÃÄ£ÐͽṹÈçÏ£º
undefined

Ä£ÐÍÔ¤²â½á¹ûÓÉÏÂʽµÃ³ö£º
undefined

¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÔÚÁ½¸öÊý¾Ý¼¯ÉÏÆÀ¹ÀÌá³öµÄ·½·¨£ºFewRelºÍNYT-25¡£FewRel Êý¾Ý¼¯°üº¬70,000¸ö¾ä×ÓºÍ100¸ö¹Øϵ£¨Ã¿¸ö¹ØϵÓÐ700¸ö¾ä×Ó£©£¬Êý¾ÝÀ´Ô´ÎªÎ¬»ù°Ù¿Æ¡£ÆäÖÐ64¸ö¹ØϵÓÃÓÚѵÁ·£¬16¸öÓÃÓÚÑéÖ¤£¬ÒÔ¼°20¸öÓÃÓÚ²âÊÔ¡£²âÊÔÊý¾Ý¼¯°üº¬ 10,000 ¸ö¾ä×Ó£¬±ØÐëÔÚÏßÆÀ¹À¡£NYT-25Êý¾Ý¼¯ÊÇÓÉGao et al., 2019¡£DeepBlueAIÍŶÓËæ»ú³éÈ¡ 10 ¸ö¹ØϵÓÃÓÚѵÁ·£¬5 ¸öÓÃÓÚÑéÖ¤£¬10 ¸öÓÃÓÚ²âÊÔ¡£

ʵÑé½á¹ûÈçÏ£º
undefined

ÈçÉϱíËùʾ£¬ÔÚËùÓеÄʵÑéÉèÖÃÏ£¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÌá³öµÄMapRE£¬ÓÉÓÚÔÚԤѵÁ·ºÍ΢µ÷Öоù¿¼ÂÇÁËsupport setÑù±¾¾ä×Ӻ͹Øϵ±êÇ©ÐÅÏ¢£¬ÌṩÁËÎȶ¨µÄÐÔÄܱíÏÖ£¬²¢´ó·ùÓÅÓÚһϵÁÐbaseline·½·¨¡£½á¹ûÖ¤Ã÷ÁËÍŶÓÌá³öµÄ¿ò¼ÜµÄÓÐЧÐÔ£¬²¢±íÃ÷Á˹Øϵ³éÈ¡ÖйØϵ±êÇ©ÓïÒåÓ³ÉäÐÅÏ¢µÄÖØÒªÐÔ¡£


¿­·¢Óé·¢K8comDeepBlueAIÍŶӽøÒ»²½¿¼ÂÇÁ˵Í×ÊÔ´¹Øϵ³éÈ¡µÄ¼«¶ËÌõ¼þ£¬¼´ÁãÑù±¾µÄÇé¿ö¡£ÔÚ¸ÃÉ趨Ï£¬Ä£ÐÍÊäÈë²»°üº¬ÈκÎsupport setÑù±¾¡£ÔÚÁãÑù±¾Ìõ¼þÏ£¬ÒÔÉϴ󲿷ÖÉÙÑù±¾¹Øϵ³éÈ¡¿ò¼Ü²»ÊÊÓã¬ÒòΪÆäËü¸ÃÀàÄ£Ð͵Äÿ¸ö¹ØϵÀà±ðÖÐÖÁÉÙÐèÒªÓÐÒ»¸öÑù±¾¡£
undefined

½á¹û±íÃ÷£¬ÓëÆäËü×î½üÁãÑù±¾Ñ§Ï°¹¤×÷Ïà±È£¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÌá³öµÄMapREÔÚËùÓÐÉ趨϶¼»ñµÃÁ˳öÉ«µÄ±íÏÖ£¬Ö¤Ã÷ÁËMapREµÄÓÐЧÐÔ¡£



×ܽá


ÔÚÕâÏ×÷ÖУ¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÌá³öÁËÒ»ÖÖͬʱ¿¼ÂDZêÇ©ÐÅÏ¢ºÍÑù±¾ÐÅÏ¢µÄ¹Øϵ³éÈ¡Ä£ÐÍ£¬MapRE¡£´óÁ¿ÊµÑé½á¹û±íÃ÷£¬MapREÄ£ÐͶԼලÐÔ¹Øϵ³éÈ¡¡¢ÉÙÑù±¾¹Øϵ³éÈ¡ºÍÁãÑù±¾¹Øϵ³éÈ¡ÈÎÎñÖÐչʾÁ˳öÉ«µÄ±íÏÖ¡£½á¹û±íÃ÷Ñù±¾ºÍ±êÇ©ÐÅÏ¢Á½ÕßÔÚԤѵÁ·ºÍ΢µ÷Öж¼Æðµ½ÁËÖØÒª×÷Óá£ÔÚÕâÏ×÷ÖУ¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓûÓÐÑо¿ÁìÓòǨÒÆÔì³ÉµÄDZÔÚÓ°Ï죬ÎÒÃǽ«Ïà¹Ø·ÖÎö×÷ΪÏÂÒ»²½µÄ¹¤×÷¡£


×ÛÉÏ£¬¿­·¢Óé·¢K8comDeepBlueAIÍŶÓÌá³öµÄMapREÄ£ÐͽáºÏÁËÁãÑù±¾ºÍÉÙÑù±¾Ñ§Ï°µÄÌص㣬½áºÏÁËͬ¹ØϵÑù±¾ºÍ¹ØϵÓïÒåÁ½¸ö·½ÃæµÄÐÅÏ¢£¬Ä¿Ç°ÒÑÔÚ¿­·¢Óé·¢K8com¿Æ¼¼ÖÇÄÜÊý¾Ý±êעƽ̨Îı¾¹Øϵ³éÈ¡¹¦ÄÜÖеÃÒÔÓ¦Ó㬴ó·ùÌáÉýÁËÄ£ÐÍÔÚÉÙÁ¿ÑµÁ·Ñù±¾ÏµıíÏÖ£¬ÔÚÊý¾ÝµÄÖÇÄܱê×¢µÈÁìÓò¿É´ó·ù½ÚÊ¡ÈËÁ¦£¬ÌáÉý±êעЧÂʼ°±ê×¢ÖÊÁ¿¡£

Document

¿­·¢Óé·¢K8com