Big data storage and management in SaaS applications View Full Text


Ontology type: schema:ScholarlyArticle      Open Access: True


Article Info

DATE

2017-09-22

AUTHORS

Xi Zheng, Min Fu, Mohit Chugh

ABSTRACT

As an important service model for advanced computing, SaaS uses a defined protocol that manages services and applications. The popularity of advanced computing has reached a level that has led to the generation of large data sets, which is also called Big data. Big data is evolving with great velocity, large volumes, and great diversity. Such an amplification of data has brought into question the existing database tools in terms of their capabilities. Previously, storage and processing of data were simple tasks; however, it is now one of the biggest challenges in the industry. Experts are paying close attention to big data. Designing a system capable of storing and analyzing such data in order to extract meaningful information for decision-making is a priority. The Apache Hadoop, Spark, and NoSQL databases are some of the core technologies that are being used to solve these issues. This paper contributes to the solutions to the issues of big data storage and processing. It presents an analysis of the current technologies in the industry that could be useful in this context. Efforts have been focused on implementing a novel Trinity model, which is built using the lambda architecture with the following technologies: Hadoop, Spark, Kafka, and MongoDB. More... »

PAGES

18-29

References to SciGraph publications

Identifiers

URI

http://scigraph.springernature.com/pub.10.1007/s41650-017-0031-9

DOI

http://dx.doi.org/10.1007/s41650-017-0031-9

DIMENSIONS

https://app.dimensions.ai/details/publication/pub.1091891150


Indexing Status Check whether this publication has been indexed by Scopus and Web Of Science using the SN Indexing Status Tool
Incoming Citations Browse incoming citations for this publication using opencitations.net

JSON-LD is the canonical representation for SciGraph data.

TIP: You can open this SciGraph record using an external JSON-LD service: JSON-LD Playground Google SDTT

[
  {
    "@context": "https://springernature.github.io/scigraph/jsonld/sgcontext.json", 
    "about": [
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/08", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information and Computing Sciences", 
        "type": "DefinedTerm"
      }, 
      {
        "id": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/0806", 
        "inDefinedTermSet": "http://purl.org/au-research/vocabulary/anzsrc-for/2008/", 
        "name": "Information Systems", 
        "type": "DefinedTerm"
      }
    ], 
    "author": [
      {
        "affiliation": {
          "alternateName": "Macquarie University, Macquarie Park, 2109, NSW, Australia", 
          "id": "http://www.grid.ac/institutes/grid.1004.5", 
          "name": [
            "Macquarie University, Macquarie Park, 2109, NSW, Australia"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Zheng", 
        "givenName": "Xi", 
        "id": "sg:person.07673312776.98", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07673312776.98"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Macquarie University, Macquarie Park, 2109, NSW, Australia", 
          "id": "http://www.grid.ac/institutes/grid.1004.5", 
          "name": [
            "Macquarie University, Macquarie Park, 2109, NSW, Australia"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Fu", 
        "givenName": "Min", 
        "id": "sg:person.016150437403.44", 
        "sameAs": [
          "https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016150437403.44"
        ], 
        "type": "Person"
      }, 
      {
        "affiliation": {
          "alternateName": "Deakin University, 3220, Geelong VIC, Australia", 
          "id": "http://www.grid.ac/institutes/grid.1021.2", 
          "name": [
            "Deakin University, 3220, Geelong VIC, Australia"
          ], 
          "type": "Organization"
        }, 
        "familyName": "Chugh", 
        "givenName": "Mohit", 
        "type": "Person"
      }
    ], 
    "citation": [
      {
        "id": "sg:pub.10.1186/2192-113x-2-22", 
        "sameAs": [
          "https://app.dimensions.ai/details/publication/pub.1043498333", 
          "https://doi.org/10.1186/2192-113x-2-22"
        ], 
        "type": "CreativeWork"
      }
    ], 
    "datePublished": "2017-09-22", 
    "datePublishedReg": "2017-09-22", 
    "description": "As an important service model for advanced computing, SaaS uses a defined protocol that manages services and applications. The popularity of advanced computing has reached a level that has led to the generation of large data sets, which is also called Big data. Big data is evolving with great velocity, large volumes, and great diversity. Such an amplification of data has brought into question the existing database tools in terms of their capabilities. Previously, storage and processing of data were simple tasks; however, it is now one of the biggest challenges in the industry. Experts are paying close attention to big data. Designing a system capable of storing and analyzing such data in order to extract meaningful information for decision-making is a priority. The Apache Hadoop, Spark, and NoSQL databases are some of the core technologies that are being used to solve these issues. This paper contributes to the solutions to the issues of big data storage and processing. It presents an analysis of the current technologies in the industry that could be useful in this context. Efforts have been focused on implementing a novel Trinity model, which is built using the lambda architecture with the following technologies: Hadoop, Spark, Kafka, and MongoDB.", 
    "genre": "article", 
    "id": "sg:pub.10.1007/s41650-017-0031-9", 
    "isAccessibleForFree": true, 
    "isPartOf": [
      {
        "id": "sg:journal.1290433", 
        "issn": [
          "2096-1081", 
          "2509-3312"
        ], 
        "name": "Journal of Communications and Information Networks", 
        "publisher": "Institute of Electrical and Electronics Engineers (IEEE)", 
        "type": "Periodical"
      }, 
      {
        "issueNumber": "3", 
        "type": "PublicationIssue"
      }, 
      {
        "type": "PublicationVolume", 
        "volumeNumber": "2"
      }
    ], 
    "keywords": [
      "big data storage", 
      "big data", 
      "advanced computing", 
      "data storage", 
      "processing of data", 
      "large data sets", 
      "Lambda architecture", 
      "SaaS applications", 
      "NoSQL databases", 
      "Apache Hadoop", 
      "database tools", 
      "core technology", 
      "service model", 
      "meaningful information", 
      "Hadoop", 
      "computing", 
      "simple task", 
      "data sets", 
      "big challenge", 
      "such data", 
      "large volumes", 
      "current technology", 
      "Spark", 
      "technology", 
      "trinity model", 
      "processing", 
      "architecture", 
      "applications", 
      "task", 
      "storage", 
      "Kafka", 
      "issues", 
      "SAA", 
      "popularity", 
      "services", 
      "data", 
      "capability", 
      "database", 
      "information", 
      "experts", 
      "set", 
      "industry", 
      "model", 
      "protocol", 
      "tool", 
      "challenges", 
      "system", 
      "solution", 
      "context", 
      "management", 
      "order", 
      "efforts", 
      "generation", 
      "priority", 
      "terms", 
      "attention", 
      "close attention", 
      "greater velocity", 
      "analysis", 
      "questions", 
      "diversity", 
      "volume", 
      "great diversity", 
      "levels", 
      "velocity", 
      "amplification", 
      "paper"
    ], 
    "name": "Big data storage and management in SaaS applications", 
    "pagination": "18-29", 
    "productId": [
      {
        "name": "dimensions_id", 
        "type": "PropertyValue", 
        "value": [
          "pub.1091891150"
        ]
      }, 
      {
        "name": "doi", 
        "type": "PropertyValue", 
        "value": [
          "10.1007/s41650-017-0031-9"
        ]
      }
    ], 
    "sameAs": [
      "https://doi.org/10.1007/s41650-017-0031-9", 
      "https://app.dimensions.ai/details/publication/pub.1091891150"
    ], 
    "sdDataset": "articles", 
    "sdDatePublished": "2022-08-04T17:06", 
    "sdLicense": "https://scigraph.springernature.com/explorer/license/", 
    "sdPublisher": {
      "name": "Springer Nature - SN SciGraph project", 
      "type": "Organization"
    }, 
    "sdSource": "s3://com-springernature-scigraph/baseset/20220804/entities/gbq_results/article/article_744.jsonl", 
    "type": "ScholarlyArticle", 
    "url": "https://doi.org/10.1007/s41650-017-0031-9"
  }
]
 

Download the RDF metadata as:  json-ld nt turtle xml License info

HOW TO GET THIS DATA PROGRAMMATICALLY:

JSON-LD is a popular format for linked data which is fully compatible with JSON.

curl -H 'Accept: application/ld+json' 'https://scigraph.springernature.com/pub.10.1007/s41650-017-0031-9'

N-Triples is a line-based linked data format ideal for batch operations.

curl -H 'Accept: application/n-triples' 'https://scigraph.springernature.com/pub.10.1007/s41650-017-0031-9'

Turtle is a human-readable linked data format.

curl -H 'Accept: text/turtle' 'https://scigraph.springernature.com/pub.10.1007/s41650-017-0031-9'

RDF/XML is a standard XML format for linked data.

curl -H 'Accept: application/rdf+xml' 'https://scigraph.springernature.com/pub.10.1007/s41650-017-0031-9'


 

This table displays all metadata directly associated to this object as RDF triples.

144 TRIPLES      21 PREDICATES      92 URIs      83 LITERALS      6 BLANK NODES

Subject Predicate Object
1 sg:pub.10.1007/s41650-017-0031-9 schema:about anzsrc-for:08
2 anzsrc-for:0806
3 schema:author N6db6d4ea5d344b328871d4c3d1c1b3ff
4 schema:citation sg:pub.10.1186/2192-113x-2-22
5 schema:datePublished 2017-09-22
6 schema:datePublishedReg 2017-09-22
7 schema:description As an important service model for advanced computing, SaaS uses a defined protocol that manages services and applications. The popularity of advanced computing has reached a level that has led to the generation of large data sets, which is also called Big data. Big data is evolving with great velocity, large volumes, and great diversity. Such an amplification of data has brought into question the existing database tools in terms of their capabilities. Previously, storage and processing of data were simple tasks; however, it is now one of the biggest challenges in the industry. Experts are paying close attention to big data. Designing a system capable of storing and analyzing such data in order to extract meaningful information for decision-making is a priority. The Apache Hadoop, Spark, and NoSQL databases are some of the core technologies that are being used to solve these issues. This paper contributes to the solutions to the issues of big data storage and processing. It presents an analysis of the current technologies in the industry that could be useful in this context. Efforts have been focused on implementing a novel Trinity model, which is built using the lambda architecture with the following technologies: Hadoop, Spark, Kafka, and MongoDB.
8 schema:genre article
9 schema:isAccessibleForFree true
10 schema:isPartOf N706be1c57b844449bca1f6e9c5657293
11 N80bb7c44cd20496eb73b0e93617d4d7b
12 sg:journal.1290433
13 schema:keywords Apache Hadoop
14 Hadoop
15 Kafka
16 Lambda architecture
17 NoSQL databases
18 SAA
19 SaaS applications
20 Spark
21 advanced computing
22 amplification
23 analysis
24 applications
25 architecture
26 attention
27 big challenge
28 big data
29 big data storage
30 capability
31 challenges
32 close attention
33 computing
34 context
35 core technology
36 current technology
37 data
38 data sets
39 data storage
40 database
41 database tools
42 diversity
43 efforts
44 experts
45 generation
46 great diversity
47 greater velocity
48 industry
49 information
50 issues
51 large data sets
52 large volumes
53 levels
54 management
55 meaningful information
56 model
57 order
58 paper
59 popularity
60 priority
61 processing
62 processing of data
63 protocol
64 questions
65 service model
66 services
67 set
68 simple task
69 solution
70 storage
71 such data
72 system
73 task
74 technology
75 terms
76 tool
77 trinity model
78 velocity
79 volume
80 schema:name Big data storage and management in SaaS applications
81 schema:pagination 18-29
82 schema:productId N008a8a7385a145b5ab86a17798809897
83 Neab7f1e976a0428abd1564bf69be7d27
84 schema:sameAs https://app.dimensions.ai/details/publication/pub.1091891150
85 https://doi.org/10.1007/s41650-017-0031-9
86 schema:sdDatePublished 2022-08-04T17:06
87 schema:sdLicense https://scigraph.springernature.com/explorer/license/
88 schema:sdPublisher N22aa37dde60e4de19d349aea3f892073
89 schema:url https://doi.org/10.1007/s41650-017-0031-9
90 sgo:license sg:explorer/license/
91 sgo:sdDataset articles
92 rdf:type schema:ScholarlyArticle
93 N008a8a7385a145b5ab86a17798809897 schema:name dimensions_id
94 schema:value pub.1091891150
95 rdf:type schema:PropertyValue
96 N22aa37dde60e4de19d349aea3f892073 schema:name Springer Nature - SN SciGraph project
97 rdf:type schema:Organization
98 N6db6d4ea5d344b328871d4c3d1c1b3ff rdf:first sg:person.07673312776.98
99 rdf:rest N8af9a6cc775144bfa478e98bb6cfb28c
100 N706be1c57b844449bca1f6e9c5657293 schema:issueNumber 3
101 rdf:type schema:PublicationIssue
102 N80bb7c44cd20496eb73b0e93617d4d7b schema:volumeNumber 2
103 rdf:type schema:PublicationVolume
104 N8af9a6cc775144bfa478e98bb6cfb28c rdf:first sg:person.016150437403.44
105 rdf:rest Nc3bcb16cb2bb443a8f20d030045d4452
106 Nc3bcb16cb2bb443a8f20d030045d4452 rdf:first Nec8f959be2a04e04bc0904fd80401916
107 rdf:rest rdf:nil
108 Neab7f1e976a0428abd1564bf69be7d27 schema:name doi
109 schema:value 10.1007/s41650-017-0031-9
110 rdf:type schema:PropertyValue
111 Nec8f959be2a04e04bc0904fd80401916 schema:affiliation grid-institutes:grid.1021.2
112 schema:familyName Chugh
113 schema:givenName Mohit
114 rdf:type schema:Person
115 anzsrc-for:08 schema:inDefinedTermSet anzsrc-for:
116 schema:name Information and Computing Sciences
117 rdf:type schema:DefinedTerm
118 anzsrc-for:0806 schema:inDefinedTermSet anzsrc-for:
119 schema:name Information Systems
120 rdf:type schema:DefinedTerm
121 sg:journal.1290433 schema:issn 2096-1081
122 2509-3312
123 schema:name Journal of Communications and Information Networks
124 schema:publisher Institute of Electrical and Electronics Engineers (IEEE)
125 rdf:type schema:Periodical
126 sg:person.016150437403.44 schema:affiliation grid-institutes:grid.1004.5
127 schema:familyName Fu
128 schema:givenName Min
129 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.016150437403.44
130 rdf:type schema:Person
131 sg:person.07673312776.98 schema:affiliation grid-institutes:grid.1004.5
132 schema:familyName Zheng
133 schema:givenName Xi
134 schema:sameAs https://app.dimensions.ai/discover/publication?and_facet_researcher=ur.07673312776.98
135 rdf:type schema:Person
136 sg:pub.10.1186/2192-113x-2-22 schema:sameAs https://app.dimensions.ai/details/publication/pub.1043498333
137 https://doi.org/10.1186/2192-113x-2-22
138 rdf:type schema:CreativeWork
139 grid-institutes:grid.1004.5 schema:alternateName Macquarie University, Macquarie Park, 2109, NSW, Australia
140 schema:name Macquarie University, Macquarie Park, 2109, NSW, Australia
141 rdf:type schema:Organization
142 grid-institutes:grid.1021.2 schema:alternateName Deakin University, 3220, Geelong VIC, Australia
143 schema:name Deakin University, 3220, Geelong VIC, Australia
144 rdf:type schema:Organization
 




Preview window. Press ESC to close (or click here)


...