Most Viewed

  • Published in last 1 year
  • In last 2 years
  • In last 3 years
  • All

Please wait a minute...
  • Select all
    |
  • SUN Yusheng, ZENG Junhao
    Scientific Information Research. 2024, 6(4): 11-24. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.002
    [Purpose/significance]The article reveals the theoretical systems, technological systems, and applied systems of vector databases, aiming to promote innovation in the research and practice of multimodal AI related theories, technologies, and applications. [Method/process]This article elaborates on the evolution of vector databases and defines its core concepts through literatures tracing and content analyzing. Subsequently, it compares and analyzes their characteristics and values, and based on this, sorts out their application mechanisms, functions, corresponding key technologies and application modes. Simultaneously, it discusses the challenges and countermeasures faced by vector databases, and looks forward to their development trends from theoretical, technical, and application perspectives. [Result/conclusion]Vector databases originate from the construction of the vector index method system, develop in vector data retrieval engine construction, and mature in vector database management system construction. Compared to relational databases and graph databases, vector databases exhibit obvious characteristics in data models, indexing mechanisms. They hold various value for users, data managers, developers and researchers. The key technologies are divided into three categories: vector data embedding generation, vector data indexing, and vector data retrieval. Application patterns can be divided into three categories: data-driven applications, knowledge-driven applications and scenario-driven applications. Challenges exist in various aspects such as high-quality generation, semantic description, storage resource utilization, collaborative sharing, and ethical security of vector data. Trends point towards the systematization of theoretical frameworks, maturation of technical solutions, and ecosystem development of application services.
  • ZHANG Jianan, YU Hong, WANG Yanfei, ZHANG Yuxiang, WANG Yanhua, JIANG Xun
    Scientific Information Research. 2024, 6(4): 1-10. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.001
    [Purpose/significance]The most prominent feature of the new quality productivity is scientific and technological innovation. In the new era, intelligence plays a new role of eyes and ears, spearhead and staff, and also plays a new role of guidance. This paper analyzes the intelligent revolution of new quality productivity from the perspective of information science, and probes into its challenges and opportunities to traditional intelligence work. [Method/process]New quality productivity takes data as the core element, forms wisdom based on data, enables scientific and technological innovation, and promotes social and economic development. Intelligence work needs to adapt to new productivity and improve efficiency and quality through technological innovations such as intelligent intelligence collection and knowledge graph application. [Result/conclusion]The paper emphasizes the necessity of building a "production relationship" that matches the new productivity, including strengthening organizational management, personnel training and technological innovation. Through the intelligent promotion of new quality productivity in various fields of intelligence, it can promote the high-quality development of social economy.
  • ZHANG Meng, MU Dongmei, WANG Ping, YU Haitao, WANG Shutong, ZHANG Xinyue
    Scientific Information Research. 2024, 6(4): 81-95. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.007
    [Propose/significance]By summarizing the research results of international medical data sharing, this paper provides reference for related theoretical research and sharing practice. [Method/process]This paper tared, CNKI and Web of Science databases were used as data sources, and visualization software CiteSpace was used to draw keywords clustering maps, count high-frequency keywords and cluster label words, and analyze the macro trend of research results in the field of international medical data sharing. Further use the text content analysis method to carry out a systematic analysis of the research hot topics. [Result/conclusion]The research hot topics in the field of medical data sharing can be summarized into five aspects: medical data sharing mode, stakeholders, platform technology, influencing factors, and incentive mechanism. "Blockchain" technology is widely used in the research of medical data sharing field, and plays an important role in the key issues such as platform technology and incentive mechanism, however, the practice of medical data sharing in China is still in the initial stage, and it is necessary to strengthen the theoretical research of medical data sharing, enrich the investigation forms and research paths, and improve the data sharing policies and regulations to promote the practice of medical data sharing.
  • WEI Ruibin, WANG Yidan, XU Yan
    Scientific Information Research. 2025, 7(1): 41-52. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.004
    [Purpose/significance]The main path analysis of citation networks can be used to identify important literature in specific fields and can achieve the extraction of mainstream research threads. This paper will use the main path analysis method to analyze the research path of knowledge graphs and sort out the context of their research development. [Method/process]This paper firstly obtains research papers in the field of knowledge graphs from the Web of Science platform, then uses the HistCite software to generate a direct citation network of the literature, and then imports the data into Pajek to generate multiple main paths of the dataset, and combines the content of the papers on the main paths for qualitative analysis. [Result/conclusion]Through main path analysis, some main paths in the field of knowledge graphs can be quickly identified, such as the construction of knowledge graph, research on the application of knowledge graphs in recommendation and question answering and other application scenarios, research on the application of knowledge graphs in specific application fields such as manufacturing. These paths reflect the development context and research direction of knowledge graph technology. Review studies have played an important role in the development of the knowledge.
  • CHENG Yanlei, ZHANG Linxuan, ZHANG Xu
    Scientific Information Research. 2024, 6(3): 26-40. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.003
    [Purpose/significance]Digital rule of law government is a new paradigm of government governance that is compatible with digital China and digital society. Data elements have become an important driving force for high-quality economic development in the era of digital, and public data openness is an important issue in the construction of digital rule of law government. [Method/process]In this paper, the regulations, policies and technical standards of the central government and 10 provinces, including Zhejiang, Shandong, Guizhou, Guangdong, Sichuan, Fujian, Guangxi, Hainan, Jiangxi and Jiangsu as research samples to examine the current system of regulations and policies on the opening up of public data in China, and to review the problems in the opening up of data on this basis. [Result/conclusion]The study finds that there are problems in China's public data opening such as uncertainty in the connotation boundaries, gaps in the regulatory policy system, lack of supply of administrative rule of law system, and immaturity in the mechanism for safeguarding citizens' data rights. In this regard, the concept of public data should be clearly defined and the nature of ownership, the regulatory policy system of public data openness should be improved, the administrative rule of law order of public data openness should be constructed, and the institutional mechanism of guaranteeing citizens' data rights should be improved, so as to integrate the opening up of public data into the track of the rule of law.
  • ZHOU Xiaoying, PEI Junliang
    Scientific Information Research. 2024, 6(3): 10-25. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.002
    [Purpose/significance] This paper aims to analyze the characteristics presented by Chinese information science research during the forty years between 1979-2019, so as to form an objective understanding of the development of Chinese information science. [Method/process]This paper collects 46 252 journal articles on information science and intelligence work included in the China National Knowledge Infrastructure (CNKI), takes their titles, abstracts and keywords as the corpus, and identifies the formation of 26 research themes in Chinese information science through the construction of the LDA theme model, and based on this data, and combining with the background knowledge of the development of Chinese information science, analyzes the characteristics and progress of Chinese information science research in the light of the four aspects of these research papers: the content of the study, the research targets, the characteristics embodied in the research results, and the research of information science itself. [Result/conclusion] From the perspective of content, papers in Chinese information science journals can be summarized as six major research topics; From the perspective of objects, the term information remains the most core concept in Chinese information research, with literature, journals, and papers being important research objects, and knowledge and data gradually being incorporated into the research scope; From the perspective of achievements, significant achievements have been made in literature analysis and research, information analysis and data analysis, knowledge organization and expression; and evaluation, core screening, and hotspot identification are the main application forms of Chinese information research achievements; From the perspective of research on the discipline itself, Chinese information science needs to pay more attention to theoretical practice and education.
  • BU Wenru, WANG Hao, LI Xiaomin, ZHOU Shu, DENG Sanhong,
    Scientific Information Research. 2024, 6(4): 37-52. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.004
    [Purpose/significance]Allusions, as an important and widely used rhetorical device in literary creation, hold immeasurable value for the study of ancient Chinese literature. Despite this, the automatic identification technology for allusions is not yet mature and currently relies mainly on manual identification, which requires further in-depth research. [Method/process]The article proposes an allusion citation recognition method that incorporates the function of making corrections using large language models at the decision-making level. This method combines traditional sequence labeling techniques with general large language models, introduces prompt templates, and performs output fusion at the decision layer to improve accuracy. In addition, this study also constructs a set of evaluation metrics specifically for the problem of allusion identification. [Result/conclusion]Through generalization testing, the AR_BBC_LP allusion identification model performed excellently in the experiment, with P_allu, R_allu, and F1_allu reaching 89.75%, 89.38%, and 89.56% respectively, significantly better than existing baseline models. The results show that the model not only enhances the performance of traditional sequence labeling models but also opens up new areas for the application of large language models. It also provides a new perspective and strong methodological support for the identification of allusions and their application in the study of ancient Chinese literature.
  • SU Zhen, LV Jie, ZHAO Wenyan
    Scientific Information Research. 2024, 6(4): 25-36. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.003
    [Purpose/significance]Through teasing blockchain research status and hotpot in library and information discipline,the paper sums up the research achievements and problems, and provides references for the future development of blockchain in the field of library and information. [Method/process]This paper takes CNKI and CSSCI as data source, and analyses the research of blockchain in library and information discipline from the amounts of published papers, author, institute, keyword bursts, cited references. [Result/conclusion]The research shows that blockchain research in library and information area arrived in peak in 2021, and developed with other disciplines. But there are some problems still exist such as the decrease of the amount of papers , the instability of team, the lack of cooperation, the lack of quality paper, based on problems mentioned above, the paper proposes the prospects from the research contents, the research methods ,the cooperation of author and institute, the integrative development of multi-idea.
  • WANG Dakun, HUA Bolin
    Scientific Information Research. 2025, 7(1): 131-140. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.012
    [Purpose/significance]Identifying and foreseeing emerging technologies, bring technological first-mover advantages to enterprises and governments, and grasp technological development trends in a timely manner. [Method/process]This study uses BERTopic's topic modeling method to obtain domain topic distribution, and merges paper and patent topics based on the cosine similarity of topic vectors to identify emerging topics. [Result/conclusion]Using the BERTopic topic modeling method combined with index evaluation can effectively identify emerging topics and emerging terms.Taking the field of new energy vehicles as an example to carry out empirical research, using two methods: divided verification period and data verification method, 12 of the 16 identified topics passed the verification, which verified the effectiveness of this research method.
  • YAO Zhanlei, LI Jinxuan, XU Xin
    Scientific Information Research. 2024, 6(4): 53-65. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.005
    [Purpose/signficance]As an important tool for labor market observation, the relative robustness and other characteristics inherent in the traditional occupational classification system make it difficult to reflect changes in industrial talent demand in a timely manner. To solve this problem, this paper focuses on how to use digital means to map the relationship between occupations and skill requirements, and how to construct a multi-layer vocation-skill network and how to realize it. [Method/process]Firstly, design a set of cross-job descriptors, a common language for describing different jobs in Chinese context, in order to establish a vocation description framework for recruitment data development and utilization. Secondly, the Key technology of occupational intelligence classification based on job description text is established, and then a set of methods and models for the construction of a multi-layer vocational-skill network reflecting the gradual specialization of skill requirements are formed; Finally, the method is validated by taking Shanghai IC industry as an example. [Result/conclusion]The experimental results show that the multi-layered job-skill network, which originates from the actual employment activities of enterprises, can assist us in observing diverse industrial talent needs with multi-dimensional granularity and flexibility.
  • WANG Yanfei
    Scientific Information Research. 2024, 6(3): 1-9. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.001
    [Purpose/significance]Integrity and innovation are important issues that the Institute of Science and Technology Intelligence needs to grasp. [Method/process]Regarding innovative issues based on adhering to principles and starting from historical facts, this article analyzes the role and significance of WIKID memes in scientific and technological intelligence study, explores relevant professional operational standards, proposes to strengthen the ability to compile WIKID memes under strategic concerns, implements intelligence thematic research based on dynamic clue discovery, and carefully examines the norms of meme characterization in perception scanning. [Result/conclusion]These are necessary actions for scientific and technological intelligence professionals and professional institutions to respond to the challenges of the times.
  • HUA Bolin, WANG Yingze
    Scientific Information Research. 2025, 7(1): 53-64. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.005
    [Purpose/significance]With the strong ability to process large-scale datasets and outstanding performance in various natural language processing tasks, large language models (LLMs) have excelled across multiple industries. Since scientific and technical intelligence primarily relies on textual data, LLMs are naturally well-suited for this field, ushering in a new wave of transformative changes. [Method/process]This article discusses the advantages of LLMs from five perspectives: low-dimensional dense vector representations of text, large-scale pre-trained models, fine-tuning and prompt learning, high-quality large-scale training data, and human alignment techniques. [Result/conclusion]LLMs have extensive applications in tasks such as intelligence identification, intelligence tracking, intelligence evaluation, and intelligence prediction, resulting in significant optimization improvements or paradigm shifts.
  • HAO Wenke, YANG Jianlin, MIAO Lei
    Scientific Information Research. 2024, 6(4): 96-113. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.008
    [Purpose/significance]By constructing and applying the multi-dimensional portrait system of green technology innovation enterprises in China, this paper aims to comprehensively understand the status quo, advantages and obstacles of enterprises in specific fields in green technology innovation, so as to provide scientific references and suggestions for relevant government departments and decision makers of enterprises. [Method/process]Based on resource-based view and environmental dependence theory, we select the internal and external labels of enterprises, and designs a multi-dimensional label system to objectively describe the performance of enterprises in terms of profitability, scientific research and innovation, public opinion and environmental responsibility. Then, the green technology innovation efficiency index system is constructed from the input-output stage. The undirected super-efficiency SBM model is used to calculate the green technology innovation efficiency value of enterprises in each year, and the dynamic change trend of enterprise efficiency value is analyzed by Malmquist index. [Result/conclusion]This paper takes the field of green transportation as an example, verifies that the proposed method can effectively evaluate the green technology innovation ability of listed enterprises, and analyzes the reasons for the change of technical efficiency. In addition, through the clustering of financial strength, green innovation ability and social public opinion, five types of green transportation enterprises are obtained, which provides effective guidance for different types of enterprises to formulate differentiated competitive strategies.
  • DENG Sanhong, ZHANG Yiqin, WANG Hao,
    Scientific Information Research. 2025, 7(1): 1. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.001
    [Purpose/significance]This paper explores the guiding role of Xi Jinping Thought on Socialism with Chinese Characteristics for a New Era in the development of the Information Resource Management discipline with Chinese characteristics, providing significant insights for the innovative advancement of China's Information Resource Management discipline and strengthening the discourse power of Chinese social sciences. [Method/process]This paper systematically reviews the core elements of the development philosophy of the Information Resource Management discipline within Xi Jinping Thought on Socialism with Chinese Characteristics for a New Era from a holistic perspective, elucidates the logical system of the development of the discipline from the diverse perspectives of national strategy integration, innovative development paths, and discourse system construction, and summarizes the value implications from both theoretical and practical viewpoints. Based on research papers published in core journals of the Information Resource Management discipline and related policy texts, this paper employs topic mining techniques to analyze the development overview and thematic evolution features of the research in the Information Resource Management field from a concrete perspective. [Result/conclusion]This study results indicate that Xi Jinping Thought on Socialism with Chinese Characteristics for a New Era has provided guidance for theoretical innovation and interdisciplinary integration in the field of Information Resources Management. It has strengthened the discipline's role in supporting and responding to national strategies, offering both theoretical foundations and practical references for the high-quality development of philosophy and social sciences with Chinese characteristics.
  • ZHANG Bin
    Scientific Information Research. 2024, 6(3): 83-97. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.007
    [Purpose/significance]Analyzing the laws, regulations and policies related to security intelligence in China can not only provide reference for decision-making, but also effectively enrich the connotation of China's overall national security concept. [Method/process]Using the LDA topic model, text mining was conducted on laws, regulations and policies related to security intelligence in China, and theme words were extracted from them. At the same time, based on the selection of policy indicators by existing scholars, scientifically select and design evaluation indicators for China's security intelligence laws, regulations, and policies. Referring to the overall national security concept, several representative policy contents were selected for analysis based on the PMC index model. [Result/conclusion]According to the random sampling method, 12 policy texts were selected, among which about 83.3% of China's security intelligence related laws, regulations and policies have acceptable consistency, especially the relevant laws and regulations formulated by the Standing Committee of the National People's Congress have high consistency and are acceptable. At the same time, the current laws and regulations related to security intelligence in China mainly focus on traditional security fields, with less attention paid to non-traditional security fields; The use of relevant policy tools is relatively reasonable; The current policy advantage lies in the fact that the policies implemented in the traditional security field can play a good role, while the weakness lies in the lack of a vertical network of security intelligence related laws, regulations and policies that cover various fields horizontally. This paper provides new tools and perspectives for analyzing policy texts, but further optimization can be made in indicator selection and assignment to enhance the credibility of conclusions.
  • HE Bingcheng, YANG Guoli
    Scientific Information Research. 2024, 6(3): 98-110. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.008
    [Purpose/significance]This paper focuses on the viewpoints of leading American think tanks on chip competition with China, aiming to unravel the logic behind the formulation of U.S. chip policies towards China. It also seeks to explore innovative pathways for the development of China's chip industry and provide strategic recommendations beneficial to Sino-American chip competition. [Method/process]The study selects 20 representative research reports from 9 prominent think tanks and employs a literature analysis approach to dissect the viewpoints, motives, and potential influences found in these reports. [Result/conclusion]The results reveal that most American think tanks adopt a firm stance, perceiving the rise of China's chip industry as a threat. Calls for sanctions against Chinese chips dominate the mainstream discourse. In response, China should strengthen its own innovation strategy and actively address the situation through avenues such as deepening Sino-American dialogues for understanding and seeking international multilateral cooperation.
  • WANG Yuefen, DONG Xiaoyi, HE Jin
    Scientific Information Research. 2024, 6(3): 56-68. https://doi.org/10.19809/j.cnki.kjqbyj.2024.03.005
    [Purpose/significance]Faced with the challenges of big data and artificial intelligence technology, knowledge services are undergoing profound changes, starting from the task scenarios of user needs, this paper explores the scenario-based intelligence service process that supports and matches different industries and their business scenarios. [Method/process]This paper takes the scenario-based intelligence service R-S model as the core, builds the scenario-based intelligence service solutions generation and evaluation process framework based on multi-source data aggregation, briefly describes the main contents and operation of multi-source data aggregation, and takes an organization's "Russia-Ukraine conflict" equipment information quick perception intelligence service as an example, discusses in detail the operation process and main contents from scenario service demand analysis, service element scenario, service solutions generation to service solutions evaluation. [Result/conclusion]By identifying scene elements and formalizing them, and using low-cost computational methods, then simulate human-computer interaction to establish a configuration response relationship,generate service solutions and optimize them through evaluation. The proposed scenario-based intelligence service solutions generation and evaluation process in this article can meet the needs of service users in task scenarios.
  • DOU Luyao, HUANG Yuncong, BAI Zengliang, LI Yi, ZHOU Zhigang
    Scientific Information Research. 2024, 6(4): 114-126. https://doi.org/10.19809/j.cnki.kjqbyj.2024.04.009
    [Purpose/significance]To clarify the professional skill demand and change trend of high-tech enterprises from the perspective of supply and demand optimization is helpful to accelerate the aggregation of innovation elements, gradually establish technical groups and establish a sound cooperation system. [Method/process]Based on the resume information of incumbents in the Integrated Circuit(IC)field, the collaborative filtering recommendation algorithm in  the  ime dimension was used to predict and evaluate the relationship between enterprises and professional skills in the field. By combining Word2vec word vector model and LDA model, data mining and corpus expansion of resume information are carried out to identify skill topics. Then describe the evolution path of skill theme and carry out visual expression to clarify the development status and evolution law of skill theme in IC field. [Result/conclusion]The research shows that the collaborative filtering idea based on time dimension is suitable for the prediction of the relationship between enterprises and skills, with an average accuracy of 96.68%. The evolution of skill theme in IC field shows the trend of "aggregation→dispersion→reaggregation", and the evolution law of each skill theme path is different, which is embodied in spiral cross evolution, innovation iterative evolution and technology dependency evolution. The overall skill development trend and path change law of IC industry given by the study can provide more accurate retrieval methods and skill selection directions for high-tech enterprises and incumbents.
  • HAO Jiayi, WANG Yuzhuo, ZHANG Chengzhi
    Scientific Information Research. 2025, 7(1): 16-29. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.002
    [Purpose/significance]Research methods in information science are one of the critical research directions in this field. Constructing a fine-grained research method corpus and extracting research method entities can help scholars quickly understand the research methods in this field, explore the evolution of methods and their future development trends, and lay the foundation for the service and application of the research method corpus in the subsequent digital wave. [Method/process]Firstly, based on academic articles published in the Journal of the China Society for Scientific and Technical Information from 2000 to 2023, this study randomly selected 50 articles and manually annotated the research methodology entities within them, using these as the training corpus for entity extraction. Secondly, two models, BERT-base-chinese and Chinese-BERT-wwm-ext, were selected for entity extraction, and the model with superior performance was chosen as the final entity extraction model for this study. [Result/conclusion]This paper constructs a fine-grained research method annotation corpus of informatics that includes six types of entities: theoretical entity, method entity, dataset entity, indicator entity, tool entity, and other entities. In the task of training an entity extraction model based on manually annotated corpora, the Chinese-BERT-wwm-ext model performed better, with an accuracy rate, recall rate, and F1 score of 0.808 2, 0.846 7, and 0.827 0, respectively. Furthermore, this paper conducts an analysis of the research method entities and their categories, discovering that research methodologies in information science are becoming increasingly diverse, with emerging technologies coexisting alongside traditional methods, each showcasing their unique strengths.
  • LI Junhua, YUAN Qian, YAN Xiang, LV Changhong
    Scientific Information Research. 2025, 7(1): 95-108. https://doi.org/10.19809/j.cnki.kjqbyj.2025.01.009
    [Purpose/significance]This study aims to automatically generate claims using the GPT-4 model, in order to reduce the writing difficulty for inventor and improve the work efficiency and quality. [Method/process]The article constructs Prompts suitable for automatically generating patent claims and implements four prompting strategies: Zero-Shot, Exact-Drafting, Stepwise-Claim, and Exact-Step Claim. By inputting patent specifications and technical disclosure documents into the GPT-4 model and using Prompts to guide its output, the automated generation of patent claims is achieved. The ROUGE and BERTScore evaluation metrics were used to assess the quality of the text, and the generated text was analyzed in comparison with the reference text from multiple dimensions, including the number of claims, text length, high-frequency words, keywords, and common collocations. Finally, the quality of the generated claim documents was evaluated through expert assessment in five aspects: clarity, consistency, relevance, professionalism, and completeness. [Result/conclusion]Empirical research shows that the Exact-Step Claim prompting strategy significantly improves the quality of generated claims; moreover, claims generated based on patent specifications are more closely matched in the number of claims and text length with the reference texts, indicating that the application effect of the GPT-4 model in the field of natural language understanding and generation is closely related to the quality of the input text. This study provides an efficient and intelligent assistance method that contributes to the development of the patent text writing and review field. However, there are challenges, and further improvements are needed for the model to precisely understand complex technical terms and comply with patent regulations, as well as to explore how to optimize the model's ability to judge the number of claims and the length of the text.